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Abstract 

In the repeated Prisoner's Dilemma, when every player has a dif- 
ferent discount factor, the grim-trigger strategy is an equilibrium if 
and only if the discount factor of each player is higher than some 
threshold. What happens if the players have incomplete information 
regarding the discount factors? In this work we look at repeated games 
in which each player has incomplete information regarding the other 
player's discount factor, and ask when a pair of grim-trigger strategies 
is an equilibrium. We provide necessary and sufficient conditions for 
such strategies to be an equilibrium. We characterize the states of 
the world in which the strategies are not triggered, i.e., the players 
cooperate, in such equilibria (or e-equilibria) , and ask whether these 
"cooperation events" are close to those in the complete information 
case, when the information is "almost" complete, in several senses. 
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More than Two Players 



1 Introduction 



In the repeated Prisoner's Dilemma, when every player has a different dis- 
count factor, the grim-trigger strategy is an equilibrium if and only if the 
discount factor of each player is sufficiently close to 1. A similar situation 
holds for other repeated two-player games in which there is a pair of pure ac- 
tions r = (ti, T2) such that under r the payoff for each player is strictly higher 
than some equilibrium payoff: there are two thresholds A°, such that the 
grim-trigger course of action r* = (rj*, ), under which the players follows r 
until a deviation occur, and then they switch to the equilibrium action that 
punishes the other player, is an equilibrium if and only if Aj > A° for i = 1,2. 
In a symmetric game, like the Prisoner's Dilemma, these thresholds are the 
same for both playersE 

In this work we look at repeated games in which each player has in- 
complete information regarding the other player's discount factor, and ask 
when a pair of grim-trigger strategies is an equilibrium. A strategy in the 
incomplete information game is information- dependent: a strategy assigns a 
"course of action" , which is a strategy in the repeated game (with complete 
information), to each state of the world. A pair of strategies will be called 
"conditional-grim-trigger" if it is composed of two action pairs a = (o"i,cr2) 
and r = (ri, T2), where (a) cr is an equilibrium of the one-shot game, (b) the 
payoff under r is higher than the payoff under a for both players, and (c) in 
any state of the world u, player i either plays repeatedly cij, or plays until 
a deviation from r is detected, and then he switches to playing cTj forever. 



^Note that when the players have different discount factors, there may exist equihbria 
which yield payoffs that are higher than the payoffs under any such r*. Such cooperative 
equilibria are not in the scope of this work. See Lehrer and Pauzner (1999). 
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for i = 1,2. 

In order for a player to cooperate (assuming his own discount factor is 
sufficiently high), he needs to ascribe high enough probability that the other 
player's discount factor is higher than his threshold. But he also needs to 
ascribe high enough probability that the other player ascribes high enough 
probability that his own discount factor is higher than his own threshold, 
and so on. We thus get that infinitely many conditions need to hold in order 
for the conditional-grim-trigger strategy pair to be an equilibrium, one for 
each level of belief for each player. Note that the "high enough" probability 
in each level depends on the player's own discount factor, since the higher his 
discount factor is, the player will lose more if the grim-trigger course of action 
is triggered. Therefore, a player with high discount factor will cooperate in 
situations where he wouldn't have cooperated if his discount factor was lower. 

We show that this is not the case, and only two conditions for each player 

are necessary and sufficient to characterize when the conditional-grim-trigger 

strategy is a Bayesian equilibrium. In this strategy, each player plays the 

grim-trigger course of action in some states of the world, which is called his 

"cooperation event" , and the punishing strategy in the others. We show that 

this strategy is a Bayesian equilibrium if and only if (a) each player plays 

the grim-trigger course of action only when his discount factor is above his 

threshold; and (b) each player ascribes sufficiently high probability to the 

other player's cooperation event whenever he plays the grim-trigger course 

of action, and a sufficiently low probability to the other player's cooperation 

^This is a narrower sense of the concept of grim-trigger strategy, since we demand that 
the pair of punishing strategies wih define an equihbrium, instead of any pair a = (cti, (72) 
under which max^' Ui{<T[,aj) < Ui(r) (Here and below, i is an arbitrary player and j is 
the player which is not i). Because of this assumption, a conditional-grim-trigger strategy 
pair Bayesian equilibrium is a perfect Bayesian equilibrium. 



6 



event whenever he does not play this course of action. This result holds for 
all belief structures, whether they are derived from a common prior or not. 

We describe the sets of states of the world that satisfy these conditions, 
and relate them to the concept of /-belief and common-/-belief. These two 
concepts generalize the concepts of p-belief and common-p-belief defined by 
Samet and Mondrer (1989). In particular, wc show that for the repeated Pris- 
oner's Dilemma, the strategy profile above is an equilibrium whenever each 
player plays the cooperation strategy if he /-believes that it is a common- 
/-behef that both players' discount factors are above a given threshold. For 
games with more then two actions for each player, an additional condition is 
needed. 

We also show that these conditions arc sufficient, though not necessary, for 
this kind of strategy to be a Bayesian equilibrium even if each player does not 
know his own discount factor, and also in a larger class of two-player games 
with incomplete information, in which there is an equilibrium which holds 
in all states of the nature (equivalent to the "punishment" equilibrium) and 
an equilibrium which holds only in some states of nature (equivalent to the 
"cooperation" grim-trigger equilibrium, which holds only for high discount 
factors). We also show that similar conditions are sufficient, though not 
necessary, in repeated games with incomplete information with more than 
two players. 

Last, we look at belief spaces in which the information is "almost" com- 
plete, in several senses, and see that in some senses, when the information is 
almost complete, there is a conditional-grim-trigger e-equilibrium in which 
the players cooperate in all states of the world where they could have cooper- 
ated under equilibrium in the complete information case, but for a set of small 
probability, while in other senses, there may be no conditional-grim-trigger 
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e-equilibrium at all. 

2 The Model 

Let r = (A^, {Ai)i(zN, (Mj)jgAr) be a two-player one-shot game: = {1,2} is 
the set of players, and for every player i ^ N, Ai is the set of pure actions 
of player i, and Ui is his utility function (extended multilinearly to mixed 
strategies). Let cr = [ai,a2) be a Nash equilibrium in mixed-strategies in F. 
Assume that the payments in another non-equilibrium pure action profile r = 
(ti,T2), which we call a cooperation profile, are higher than the equilibrium 
payments, that is Uii^r) > Ui{a) for i = 1,2. Also, assume that for i = 1,2, 
Ti is not a best response to aj, and, in particular, not in the supporjfl of CTj. 

Let G = {N, {S,S),Il, {Ai)i^]\r, (Mj)ieAr) be the repeated game based on F, 
with incomplete information regarding the discount factors: 

• iV = {1, 2} is the set of players. 

• {S, S) is a measurable space of the states of nature, that is, S C [0, 1)^ 
is the set of possible pairs of discount factors of the players. S is the 
(T-algebra that is induced on S by the Borel a-algebra on [0, 1)^. 

• n = {Q, E, A, (-Pi)igAr) is the players' belief space: 

— {Q, S) is a measurable space of states of the world. 

— X : Q S is a measurable function between the states of the 
world and the states of nature, i.e., the players' discount factors 
are \{uj) = (Xi^u), A2(w)), where Xi^ou) is player i's discount factor 

■^We discuss this assumption in 13.51 below. 
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in the state of the world oj. The definition of S imphes that 
{uj I \{uj) e B} eT., for every open B C [0, 1)^. 

— : r2 — 7- A{Q) is a measurable function that assigns a belief for 
player i to each state of the world u. We denote by Pi{E \ u) the 
probability that player i ascribes to the event E C Q at the state 
of the world u, and by Ei{- \ u) the corresponding expectation 
operator. Pj is measurable in the sense that for every E C 
Pi{E I ■) is a measurable function. Pi is consistent, in the sense 
that each player knows his belief: Pi{{u}' : Pi{u}) = Pi{u')} \ 
u) = 1, for every u E Q. We assume that Pi is such that each 
player knows his own discount factor in every state of the world 
u: Pi{{u' : \i{ijj') = \i{u)} \ u) = 1 for every u E Q. In Section 
19.11 we drop this assumption. 

Because Pi is consistent, it divides Q into disjoint "types" of player z; 
that is, into the sets {{w' | Piiu') = Pi{ijj)},ijj G Denote by Sj C E 
the cr-algebra generated by these sets, and by the sets {u \ Xi{uj) G B}, 
for every open set B C [0, 1) (since each player knows his own discount 
factor, these sets are unions of his types). We will call a subset of Q or 
a function from Q i-measurable if it is measurable with respect to Sj. 

• Ai and Ui are the same as in F: the set of pure actions of player i, and 
his utility function (in each stage of the game), which are independent 
of the state of the world. 

A course of action of player i is function that assigns a mixed action 
of player i to each finite history of actions in the game. This function is 
independent of the state of the world. A strategy of player i is an i-measurable 
function rji that assigns a course of action r]i{u}) to each state of the world u. 
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The payoff of player i when the profile {rji^rjj) is played, conditional on the 
state of the world w, is 'yi{rii,rij \ u) = Ki^Y'^uj \ u), where m* is 

the utility in stage t. The expectatioro depends on u, since player j's actions 
and player i's belief may depend on u. As mentioned, we assume that the 
discount factor Aj is known to player i. 

Recall that r = (ri, T2) is a cooperative profile, and a = {ai, (72) is a Nash 
equilibrium with payoffs lower than those under r. 

Definition 2.1 A grim-trigger course of action for player i, based on a 
and T, is the course of action r* under which player i plays Ti until player j 
deviates from tj, and from that stage on plays cTi. 

In the complete information case, there are thresholds A^, A2 such that r* is 
an equilibrium if and only if Aj > A^: 

A° := min |a. | ^ - {u^, r,) + «.(^) > Va,' ^ r,} . 

Denote Ai = {u E Q \ Xi{u}) > A°}, which is the set of states in which player 
i cannot profit by deviating from the profile r*. Note that player i always 
knows whether Xi{uj) > A° or not: Aj is an i-measurable event. 
We are interested only in "conditional-grim-trigger" strategies: 

Definition 2.2 A strategy rji of player i is called conditional-grim-trigger 

strategy (with respect to a and r) if there is an i-measurable set Ki Cfl such 
that: 

^^^^^ = ^ * ^ 



''Note that this is actually the expected subjective payoff of player i, because the expec- 
tation is taken with respect to his belief Pi(aj). This is the relevant payoff for the player's 
decision making. One could define the payoff as 7,; — Ei{{l — Ai(w)) ^ii'^Y^^'^l I '^)- 

Most of our results remain unchanged with this payoff function. 
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Here a* is the course of action where player i always plays ai. We denote 
this strategy by r)*{Ki). 

Note that if a pair of grim-trigger strategies rf{Ki, K2) — {rjl{Ki),r)2{K2)) 
is played, then, because is not in (Ji's support, after the first stage both 
players learn if the other player "cooperates" . Prom that point on they do 
not learn anything else. 

Definition 2.3 If r)*{Ki, K2) is a Bayesian equilibrium, the pair of events 
{Ki,K2) is called cooperation events. 

The existence of non-empty cooperation events guarantees that the players 
may cooperate in some states of the world. 

Note that r/*(0,0) is a Bayesian equilibrium in which the players always 
follow a. Also note that if ri*{Ki, K2) is a Bayesian equilibrium and Ki = 0, 
then Kj = 0, since we assumed that Tj is not a best response to a^. 

3 Main Result: Characterization of the Co- 
operation Events 

3.1 The Complete Information Case 

When the game has complete information we can identify S and fl, and each 
player knows the true state of the world. 

Theorem 3.1 When the game G has complete information, ri*{Ki, K2) is 
a Bayesian equilibrium if and only if Ki = K2 Q A where A = Ai n A2 = 
{Xk> XI fork ^1,2). 
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Proof: Assume Ki — K2 Q A. Then for every u & Ki, the profile r* is 
played, and both players know it. Since e A, both \i> \\ and A2 > A", 
so neither player can profit by deviating. For every u ^ Ki, the profile a* 
is played, and both players know it. Since o" is a Nash equilibrium in F, 
neither player can profit by deviating. For the opposite direction, there are 
two cases: Ki = K2 ^ A and Ki ^ K2- Assume first that K\ = (f- ^1 
and let uj e Ki/K. Without loss of generality assume that Ai < A? in the 
state of the world u. In the state of the world u the profile r* is played, 
but since Ai < A°, player 1 can profit by deviating, so rf{K-i,K2) is not a 
Bayesian equilibrium. Assume Now that u e K1/K2. Then in the state of 
the world cu, player 1 knows that player 2 plays (T2, and since t\ is not a best 
response to (72, it is profitable for him to deviate from r^* to cx^, and therefore 
ri*{Ki,K2) is not a Bayesian equilibrium. □ 

Example 3.2 Prisoner's Dilemma with complete information. 



D C 



1,1 


4,0 


0,4 


3,3 



Here a — {D, D) and r = (C, C). It can be easily calculated that Xl — X2 — 
1/3, and that r]*{Ki, K2) is a Bayesian equilibrium if and only if Ki — K2 Q 
{Ai,A2>l/3}. 

3.2 The Incomplete Information Case 

Our main result is the following: 

Theorem 3.3 Let Ki C Q be an i measurable event for i = 1,2, then the 
strategy profile r]*{Ki,K2) — {r)l{Ki),r)2{K2)) is a Bayesian equilibrium if 
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and only if ,for i = 1,2, Ki C Ai and 

1. Pi{Kj I u) > fi{uj) for every u e Ki, 

2. Pi{Kj I u) < gi{uj) for every u ^ Ki, 
for the i-measurahle functions^ 

f (u) •= max ' «»(cr^,o-j)-«,(r,,(Tj) 

and 

g,{uj) ■.= mm{glg^{u),gf{uj)}, 

where cr^ is an action of player i, and Fi = {a'i \ Uiiji^aj) < Ui{a'i,aj)}. 
g} , gf{uj) and gf{uj) are derived from different kinds of deviations, and are 
defined by: 

1 . Ui{a) - Uiia'i, aj) 

g- := mm 



ai€Hl {ui{a) - Ui{a'-, aj)) + {ui{a'i, Tj) - Ui{(r„ r,)) 
where Hi := {a'i ^ cri,Ti \ Ui{cri,Tj) < Ui{cr'-,Tj)}, 

9i [^) ■— 7 J— ^— - — Y 

{ui{a) - u,{Ti, aj)) + (^T^gj) - (uiiffi, t^) + Ui{a)j^^^)j 

whenever Ui{ai,Tj) + Ui{a) (^jzy^^ ^ ( i-a "(i) ) ' '^''^'^ 9i{^) '■— ^ otherwise, 
and 

where 

Hf{uj) := {a- ^ Ti I u,{t) - Ui{ai, Tj) + {ui{a'i, Tj) - {ui{a))) \i{uj) < 0} . 



^By definition, the infinium over an empty set is 1 and the supremum over an empty 
set is 0. 
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We now present an equivalent formulation of Theorem I3.3[ using the 
concept of /-belief and common- /-belief, which we define now. 

Definition 3.4 Let N be a set of players and {Q, S, A, (Pj)jgAr) be a general 
belief space on a measurable space of states of nature {S, S). Let f : — t- 
be a measurable function, where fi is i-measurable for every i & N. Let 
A ^ Q be an event. We say that player i /-believes in the event A aX u if 
Pi{A I u) > fi{uj)- We say that the event A is an common- /-belief at u if 
in the state of the world u each player f -believes in A, f -believes that each 
other player f -believes in A, f -believes that each other player f -believes that 
each player f -believes in A, etc. 

These two concepts generalize the concepts of p-belief and common-p-belief 
described by Samet and Mondrer (1989). These concepts are discussed in 
detail in Section HI 

The definitions imply that the conditions in Theorem 13.31 are equivalent 
to the following: 

1. Kj is an /-belief in Ki. 

2. Kj is an (1 — 5f)-belief in A'f . 

In SectionlHwe prove that these conditions are also equivalent to the following 
condition: each player either /-believes that Ki fl is a common- /-belief 
or (1 — (7)-believes that the event "i^i fl K2 is not a common-/-belief" is a 
common-(l — 5f)-belief (Theorem 14.61) . 

3.3 Proof of Theorem [3731 

Player i's payoff under the strategy profile r* is 7i(T*) = y3a^' nien- 
tioned before, the strategy profile r is not an equilibrium in the one-shot 
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game, whereas a is. If player i deviates from the profile r* and plays a[ ^ Ti 
in the first stage, then from the second stage on player j will play aj, to 
which (Tj will be player i's best response. The expected payoff for player i 
will then be Ui{a[, tj) +Ui{a) ^t'^^^^^ ■ If there is a profitable deviation from r* 
for player i, there is such a profitable deviation in the first stage. Therefore 
A° := min |Aj | — {ui{a[,Tj) + Ui{(7)-^^) > Vo-^' 7^ rj|, is the minimal 
discount factor Aj such that player i cannot gain by deviating from the profile 

T*. 

We will now check player i's options to deviate from the profile rf{Ki, K2). 
Case 1: u! e Kj,. 

Note that, because Ki is i-measurable, in this case Pi{Ki \ u) — 1. 

Player i's payoff under the strategy profile ri*{Ki, Kj) is 

^,{ri*{K,,Kj)\u)^ 

mj I ^) (1^) + (1 - mj I ^)) a,) + u,{a)^) . 

Indeed, according to player i's belief, with probability Pi{Kj \ cu) player j 
plays T*, so they will play r at every stage and his payoff will be jzx^^ 
with probabihty (1 — Pi{Kj \ u)) player j plays aj so in the first stage the 
profile played will be (tj, aj) and afterwards the players will play a, so that 
player i's payoff will be Ui{Ti, aj) + Uiicr)j^^^^. 

We now check the conditions that guarantee that player i cannot profit 
by deviating: 

• Because a is an equilibrium, deviation after the first stage can be prof- 
itable for player i only if player j played tj in the first stage. Also, in 
that case, if it's profitable to deviate in stage > 2, it is also profitable 
to deviate in stage k = 2, because when rj*{Ki, K2) is played, the play- 
ers do not learn anything from the second stage onwards. After the 
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first deviation the best response to r]*{Kj) is to play (7j in all stages, 
because player j will play aj. Denote by a'** the course of action in 
which player i plays in the first stage, an if player j played tj in 
the first stage, player i plays a pure action (t[ ^ Ti in stage 2 and 0"^ 
afterwards. If player j played aj in the first stage, player i plays CTj 
from the second stage onwards. The payoff is: 

= Pi{Kj I uj) (uiir) + Ui{a'i, Tj)Xi{uj) + Ui{a)^^^^ + 
+ (1 - F^iK, I cO) (u.(t:„ a,) + u4a)j^) . 
So that rj*{Ki,K2) is an equilibrium we should have '-)i{r]*{Ki,K2) \ 
i^) > lii.'^'i* I ^) s-ll 7^ 'Til ^iid therefore, 

for every (t[ ^ Tj. Because Aj > 0, either Pi{Kj | a;) = or a; e Aj. 

Player i can also deviate in the first stage to a pure strategy a'i ^ Ti. 
As before, after the deviation his best response to i]* is to play Oi in all 
stages. His payoff in this case is (denote by a* the course of action of 
player i that plays u\ ^ Ti at stage 1 and Ui thereafter): 

+ {l-P,{K,\u)){ui{a',,a,)+u,{a)-^^^^. 
In order for r]*[Ki, K2) to be an equilibrium we should have 
^i{r]*{Ki,K2) I u) > ^i{a'*,rj*{Kj) \ oj), and therefore, for every cr- 7^ 



Ti.: 



+ {l-Pi{Kj\uj)){ui{Ti,aj)-Ui{a'„aj))>0. (1) 
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Because Xj is not a best response to aj, Pi{Kj \ u) > 0, otherwise 
inequality ([T]) becomes Ui{Ti,aj) — Ui{a[,aj) > for every a- 7^ Tj in 
contradiction. As above we deduce, that u G Aj, or Ki C Aj. There- 
fore we get that ^^'y^^-^ — {ui{a[,Tj) + Ui{o)-^^^^j^) > 0. Therefore, if 
Ui{Ti,aj) > Ui{a[,aj) inequahty d] trivially holds. Otherwise we obtain: 



Ui{a'^,aj)-Ui{Ti,a-j) 



or Pi{Kj I u) > fiiu) where fiiu) is defined by: 



where Fj = {a- | Ui{Ti,aj) < Mj(a., cTj)}. 
Case 2: u ^ Ki. 

Player i's payoff under the strategy profile ri*{Ki, K2) is 

• Player i can deviate to the strategy a'* described above (Uj may be in 
the support of a. See remark 1331 1)). His payoff in this case is: 

+ {l-P,{K,\u;))(u,{a'„a,)+u,{a)- ^'^"^^ 



1 - X^iuJ) 

So that ri*{Ki,K2) is a Bayesian equilibrium, we should have 
^i{r]*{K^,K2) I uj) > ^i{a'*,7]*{Kj) \ w), that is 

Pi{Kj I u;)((ui((Tj,rj) - nj(o-j,rj)) - (nj(o-) - Uiialaj))) + 

+ iu,ia)-u,{a'^,a,))>0. (2) 
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The last term on the left side of inequality is non-negative since a is 
an equilibrium. Therefore, if Mj((Tj, Xj) > Ui{a[,Tj) inequality H] trivially 
holds. Inequality ([2]) is equivalent to Pi{Kj \ u) < gj, where 



1 . Ui[a) - Ui[a-,aj, 

91 ■■= mm^ 



and Hi := {a^ ^ cri,Ti \ Ui{ai,Tj) < Ui{al,Tj)}. Note that gj is inde- 
pendent of u and the inequality is independent of Xi{uj). 

Player i can deviate to r*. In this case his payoff is: 

I — Ai{U) 



CO 



Because '^,{r]*{Ki, K2) \ u) > -fi{T* ,r]*{Kj) 

-(ui(o-) - Ui{ri, aj))) + {ui{a) - Ui^Ti, aj)) > 0. (3) 



As before, Ui{a) — Ui{Ti,aj) > 0. Unless Ui{ai,Tj) + Ui{a) (^-^ 



1-Ai(a;) 



Xi (ui) ^ 

< inequality ([3]) trivially holds. Therefore inequality ([3]) is 
equivalent to Pi{Kj \ u) < gfi^cj), where 

2 _ Ui{a) - Ui{Ti,Orj) 
9i \CJ) :— y — — — r- 

whenever Ui{ai,Tj) + Ui{a) (izij^) < (t^§J))' and gf{uj) := 1 oth- 
erwise. 

Player i can play the following strategy a'J': 
— Play Tj in the first stage. 
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— If j played Tj in the first stage, play Tj until stage k + 1 and then 
play a pure a'^ 7^ r^, and afterwards cxj. 

— If j played aj in the first stage, player i's best response is cxj from 
stage 2 onwards. 

The payoff is: 
7.(af,r/;(ir,) l^) = 

Note that •ji^a'j'' , rj* \ u) = 74(Aj(a;))^ + B for some constants A and 
B, independent of k. Therefore, as a function of k, ■ji^a'^ ,ri*{Kj) \ u) 
is monotonic. If it is increasing, it is smaller than the payoff in the 
previous case (playing r*). If it is decreasing, 'yi{ri*{Ki, K2) \ uj) > 
'~^i{a'^ ,rj*{Kj) I (jj) if and only if the inequality holds for k = 1. There- 
fore, we only need to consider the case of = 1: 

Pi{Kj I u) {ui{Ti, aj) - Mj(cr) + Ui{ai, Tj) - Uj(r) + 

+ {Ui{(7) - Ui{a[, Tj)){\i{u))) + {Ui{(j) - Ui{Ti, CTj)) > 0. (4) 

If Ui{T)—Ui{(7i,Tj) + {ui{al,Tj) — {ui{a)))\i{u) > inequality (jl]) trivially 
holds. Therefore inequality (jlj) is equivalent to Pi{Kj \ u) < gf{ijj), 
where 

and 

Hf{uj) := {a'- ^ Ti I Mi(r) - Ui{ai, Tj) + {ui{a[, tj) - {ui{a))) \i{uj) < 0} . 
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Then gi{oj) := mm{gl, gf{u), gf^u)}. Last, we need to prove that fi and 
gi are i measurable. Both of them are rational functions (or segment-wise 
rational functions) of Aj, and therefore, as functions of Aj, they are Borel 
functions. Since Sj contains {u \ \i{uj) E B}, for every open set B C [0, 1), 
fi and gi, as functions of u, are i-measurable. □ 

Remark 3.5 1. If Ti is in the support of ai, player i cannot lose by play- 
ing Ti in every u: Ti is a best response to aj in this case, and player j 
cannot (in this mechanism) discern the deviation and punish. There- 
fore, in this case, in order for r]*{Ki, K2) to be an equilibrium, it is 
necessary that Pi{Kj \ u) = for every u ^ Ki. Also, in this case Ki 
is not necessarily a subset of Aj — see next remark. 

2. If Ti is a best response to aj, Ki may not be a subset of Ai since we 
cannot conclude from inequality (QP that Pi{Kj \ u) > 0, and therefore 
we only get that Ki C Aj U | Pi{Kj \ u) = 0}. Moreover, in 
this case (and this case only) inequality (CP trivially holds for every 
G Aj U {w I Pi{Kj I cj) = 0} — there is no non-trivial condition on 
Pi{Kj I u) that needs to hold for u G Ki. Also, in this case gi{uj) may be 

even if Ti isn't in the support of ai because gf{u}) and gf{uj) are either 

1 or (this follows from their definition because Ui{a) = Ui{Ti, aj) ). See 
Example \3. 7[ 

Remark 3.6 1- If there is a best response a[ to cxj which is a better re- 
sponse to Tj than ai, then gi = (and then we need Pi{Kj \ u) = for 
every u ^ Ki), even if Ti is a best response to aj, because then g] = 0. 
Note that this condition depends only on the structure ofT, and not on 
the information structure. From this we deduce that if ai is not pure, 
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then gi = unless Ui{a^,Tj) = Ui{a'/,Tj) for every <J^,a'/ in the support 
ofai. 

2. Under the assumption that Ui{Ti,(Ti) < 'Uj(cr), we have fi > 0. Note 
that fi{uj) < 1 for u G Aj, and it can be defined as needed outside this 
set, see the following remark. 

3. Note that the definition of fi outside Ki and the definition of gi inside 
Ki for i = 1,2 is irrelevant for Theorem \3.3[ 

In the following example we show that if Xj is a best response to aj, 
7]*{Ki, K2) may be an equilibrium even if Ki D Ai (a strict inclusion). In 
particular, the assumption that is not a best response to aj is necessary 
for Theorem 13.31 

Example 3.7 Consider the following game: 



D C 



1,1 


4,1 


1,4 


3,3 



As m ExampleWM = {D,D), r = {C,C), and A° = = 1/3 (the 
payoff of player i when he plays C and player j plays D is irrelevant for 
the calculation of X^, as long as {C,C) remains a Nash equilibrium). Let 
Q = {1/4,3/4}^, with = , \2{ou)) = u. In any state of the world 

u, player i knows only the coordinate i of u (his own discount factor). If 
Ui = 1/4, player i believes thatu = (1/4, 1/4) (i.e., that both discount factors 
are 1/4)- //"t^i = 3/4, player i believes that uoj = 1/4 or Uj = 3/4 with equal 
probability. From A? = = 1/3 we have that Ai = {3/4} x {1/4,3/4} 
and A2 = {1/4,3/4} x {3/4}. We now show that ri*{Q, A2) is a Bayesian 
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equilibrium: Player 1 plays t* in every state of the world, and player 2 
plays T2 if 0U2 = 3/4, and always defects otherwise. Therefore, if uJi = 1/4, 
player 1 believes that U2 = 1/4:, so that player 2 always plays D, and in this 
case player 1 gains 1 in each stage regardless of his actions. If uji = 3/4, 
player 1 believes that in probability 1/2 player 2 always plays D, so the same 
argument as before holds, and in probability 1/2 player 2 plays r|. Because 
uji = 3/4 > player 1 cannot profit from deviating from t* . Therefore, 
player 1 cannot profit by deviation. Player 2 knows that player 1 plays Ti. 
If UJ2 = 3/4 he plays T2, and since 3/4 > A2 he cannot profit from deviating. 
If ^2 = 1/4, player 2 plays D in every stage. He cannot profit from playing 
T2, since 1/A < A^. Last, as shown in the proof to Theorem \3.3[ we need to 
check that he cannot profit by deviating to and indeed inequality holds 
in this case (for a2 = (J2 = D). Therefore, 'r]*{Q,A2) is indeed a Bayesian 
equilibrium. 

4 Constructing the Cooperation Events 

In this section we construct the cooperation events Ki and K2 so they will 
satisfy the conditions of Theorem 13.31 

4.1 Preliminaries — Belief Operators 

In this section we study the concepts /-belief and common-/-belief, as de- 
fined in Definition 13.41 This subsection is valid for general belief spaces, not 
only regarding the model of incomplete information regarding the discount 
factors (see definition 10.1 in Zamir-Maschler-Solan for a formal definition of 
a general belief space). 
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Definition 4.1 The /-belief operator of player i is the operator Bj : E — > 
E that assigns to each event the states of the world at which player i f -believes 
in the event: B( {A) := {uj ^ \ Pi{A \ u) > fi{u)}. 

If fi is a constant function, and fi — fj for every i and j, then the concept 
of /-behef reduces to the concept of p-behef of Mondrer and Samet (1989). 
If fi is a constant function pi, but not necessarily pi = pj for every i and j, 
then the concept of /-belief reduces to the concept of p-belief of Morris and 
Kajii (1997). Similar to the analysis of Mondrer and Samet, we prove the 
following: 

Proposition 4.2 Let B{ be an f -belief operator. The following hold: 

1. IfA,BeJ: and ACB, then B{{A) C {B). 

2. If A e E then b{{b{{A)) = b{{A). 

3. If {An)'^=i is a decreasing sequence of events then B({f]'^^^An) = 

4- IfC eT, is an i-measurahle event, then B( (C) = {C\{u} G fl \ fiioj) > 
l})U{ujen\ fi{u) < 0}. 

5. Iffi>0 or {fi < 0} C C then B{{A)nC = B{ (AdC) for every event 
A e E and i-measurable event C e E. 

Proof: The proof of parts 1 and 3 is similar to the proof for Proposition 
2 in Mondrer and Samet (for p-behef operators). To prove part 4, observe 

that for every G C, Pi{C \ u) = 1 and for every w ^ C, Pi{C \ oo) = 
0. Part 2 follows from part 4, because for every A G S, B( [A) is an i- 
measurable event (since fi and Pi{A \ •) are i-measurable), which contains 
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{u E Q \ fiiu)) < 0}. To prove part 5, assume {/j < 0} C C. In this 
case B{{C) = C \ {ft > 1}. Assume oo G B{ (AnC). From part 1 we 
have u G B{ (A) and u G B({C) C C. For the opposite direction, assume 
u G bI {A) n C. Then Pi{A \ uj) > fi{u) and Pi{C \ co) = 1 and therefore 
Pi{A n C I w) > /.(w), that is, u G 5/ (A n C). □ 

We say that C is a common- f -belief at a state of the world u if in the 
state of the world u each player /-believes C, /-believes that the other 
players /-believe C, /-believes that they /-believe that he /-believes C, 
etc. Therefore, C is a common- /-belief at u, then u G Bf (C), u G 

f]j^^j^ Bl{f\-^j^ bI (C)), and so on. In particular, if we define D°(C) := C, 
D-+\C) := {'\.^j,B({D'\C)) for every n > 0, and Df{C) := n„>i^"(C) 
we get that "C is a common- /-belief is is equivalent to w G D^{C). The 
event {C) is called C is a common- f -belief. 

By Proposition 14.21 B( is a belief operator as defined by Mondrer and 
Samet (1989), and therefore we get: 

Proposition 4.3 Let C eT., u e Vt. The following definition of "C is an 
common- f- belief at a state of the world u" is equivalent to the Definition's^: 
C is a common- f -belief at u if and only if there exist an event D E T, such 
that u e D, and D C b( {C) and D C B({D) for every i G A^. 

4.2 Constructing the Cooperation Events 

Now we go back to the incomplete information structure as defined in Section 

El 

Definition 4.4 For every two events Ci e T., i = 1, 2, define D]'^{Ci, Cj) := 
bI (Cj) nCi. Forn>l, define 

C,) := 5/ iD]'''^{C„ a)) n Dr^'^iQ, C,), and 
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The definition of D{ {Ci, Cj) is similar (tliough not identical) to the defi- 
nition of iterated p -belief of player i in Morris (1999). 

As the next Lemma states, D{{Ci,C2) and D{{C2,Ci) are the largest 
subsets of Ci and C2 (respectively), such that the first inequality in Theorem 

holds: 

Lemma 4.5 1. For every to E D{{Ci,Cj), one has P,{D^j{Cj,Q) \ u) > 
fi{uj) for i = 1,2. D{ (Ci, C2) and D2 (C2, Ci) are the largest subsets of 
Ci and C2 (respectively) such that this property holds^ 

2. If Ci is i-measurable so is D{ {Ci,Cj), for every Cj eT.. 

3. If Ci is i-measurable and {fi < 0} C d fori = 1,2, then D^'-^ {Ci,Cj) 
and D{ (Ci,Cj) depend only on the intersection of Ci and Cj. In this 
case, D({Ci, C2) H dI{C2, Ci) = D^{Ci fl C2), the event containing all 
state of the world u such that Ci H C2 is a common- f -belief at u, and 
D{ (Ci, Cj) = bJ {D^{Ci n C2)), the event event containing all state of 
the world 00 such that player i f -believes that Ci fl C2 is a common- f- 
belief. 

Note that part 3 holds when /i > and /2 > 0, and in particular to 
p-belief for p E (0, 1). 
Proof: 

^They are the largest in the following (strong) sense: if Ki C Ci and K2 C C2 fulfill 
Vcj e Ki P,{Kj I u) > f,{uj) for i = 1, 2 then K, C d{ (C^, Q). 
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1. We first argue that C B- (Dj). Indeed, by Proposition 14.2( 3). and 
since (-Dj''^)^i is a decreasing sequence of events, 

\n>l / n>l n>l 

= n ^r'' = D(. 

n>l 

Here D{ = D{{Ci,Cj) and Df'^ = Df'^{Ci,Cj). For the maximahty 
property, assume that that Ki C Ci and ^^^2 ^ C2 fulfill Pi{Kj \ u) > 
fiiuj) for every u E Ki and z = 1,2. Therefore C i?^^ (/C,) from 
which follows Dl'^{Ki,Kj) = Ki and therefore D{{Ki,Kj) = Ki. Be- 
cause Ki C Cj it follows, from Proposition 14.2( 1) that D{{Ki,Kj) C 
D{{Q,C,). 

2. This follows from the fact that Bf (C) is z-measurable for every C G S, 
and because the intersection of countably many z-measurable sets is 
i-measurable. 

3. Denote C := Ci H C2. By Proposition |12l5) one has D}{Ci,Cj) := 
B{{C). This proves the first claim. Because D][Ci,Cj) := Bf{C), 
it can be verified from the definition of that for n > 1, D"'{C) = 
D5^(Ci, C2)nD^(C2, Ci) and therefore Df{C) = D({Ci, C2)nDl{C2, Ci). 
The event "player i /-believes that Ci fl C2 is a common- /-belief" is 
the event 

B({Df{C)) = B( {D({C,, C2) n Di{C2, O) = 

= B{iD,{C„ Q)) n D{{Q, C,) = D{{Q, C,). 

The second equality follows from the second part of this lemma and 
Proposition 14.2( 5). and the last one from the first part of this lemma. 
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□ 

Using Lemma [4.51 and Proposition 13.31 we get the following result: 



Theorem 4.6 For every two events Ci and C2, such that Ci C Aj, for i = 
1, 2, and Ci is measurable according to the information of player i, the strat- 
egy profile rj*{B({D^ {Ci nC2), B({D^{Ci nC2))) is a Bayesian equilibrium if 
and only H C2))" = fi- "^(^^-^(^/(^^(Ci H C2))" H n 

C2)y)). Moreover, {B{ {D^ {Ci n C2M= 

^ are the maximal subsets of {Ci)f^^ 
with this property - if there is i such that B{ {D^{Ci H C2)) C Ki C d, then 
ri*{Ki, K2) is not an equilibrium. 

The opposite is also true: ifr]*{Ki, K2) is an equilibrium, then B{ {Df {Kif] 
Kj)) = Ki and Bl'^D^-a^Kf n K'^)) = Kf. 

In other words, r]*{Ki, K2) is an equilibrium if and only if each player 
either /-believes at u that Kir\K2 is a common- /-belief at a; or 1 — ^f-believes 
at u that the event "f^i nfr2 is not a common- /-belief" is a common-(l —g)- 
belief at u, for every state of the world u E Q. 

Also, if Ki = Bl{Df{Ci n C2)) {i = 1,2) for some Ki C Q C A^, then 
each player either /-believes at uj that Ci fl C2 is a common- /-belief at u or 
1 — (yf-believes at u that the event "Ci fl C2 is not a common- /-belief" is a 
common-(l — g)-helief at u, for every state of the world u E Q. 

This theorem shows that the argument that infinite number of conditions 
of the type "each player needs to ascribe high enough probability that the 
other player ascribes high enough probability that his own discount factor 
is higher than..." are needed, as was discussed in the introduction, holds. 
It is equivalent to the condition that in Ki player i /-believes Kj at u for 
every u E Ki for both i = 1 and i = 2. Also, as the theorem states, a similar 
condition is required for every u ^ Kj. 
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Proof: Denote C = Cir\C2- From Remark l3.6( 3). we can assure that {/» < 
0} C D{{Ci,Cj) and {1 - < 0} C D{{Ci,CjY. Therefore, Lemma 1131^3) 
holds, so D{{Q,Cj) = B{{Df{C)) and dI-'{D{ {Q,C,Y, Df^{Cj,aY) = 
B]-\D'-^{Bl{Df{C)rnBf^{Df{C)r)). Because A^(C.,C,) = 
we know from Lemma H751 (l) that the first condition in Proposition 13.31 holds, 
and that there are no larger subsets of Ci,C2 such that it holds. Therefore, 
ri*{B({D-f (C), B({D-f (C))) is an equilibrium if and only if the second con- 
dition holds. That is, for every u i B{{Df{C)) one has Pi{Bj{D^C)) \ 
< Qii^) which is equivalent to Pi{Bj{D^{C)y \ uj) > 1 — gi{uj) for ev- 
ery u G Bl{Df{C)Y or D{{Q,C,y = B^{Df{C)Y C 5^^(dJ(C„ C,)^) = 
B]'\B^- {Df{C)Y). From that we get 

Bl{Df{C)r = D{{a, C,r = Dl^'-%D{ (a, C,Y, C^Y), 
for i = 1,2, which implies that 

D{ (a, c,Y = D-''-\D{{c,, c,Y, D\[c,, aY), 

so that 

D{{a, C,Y = D]-\Dl{C,, C,Y. Dl[C„ C,Y) = 

= B]-\D'-^{Bl{Df{C)Y n 5/ {Df{C)Y)). 

For the second part, observe that Bf {D-^{Ki Cl Kj)) = Ki follows from 
Lemma WMl)- B{ {Df{Ki n Kj)) = D( (Ki, Kj) is the biggest subset of Ki 
such the first condition in Proposition 13.31 holds. But since t]*{Ki, K2) is an 
equilibrium it holds for Ki. Similarly, B]'\D^-3{KfnK'r)) = Dj^^K^, K^) 
is the biggest subset of K^ such that the second condition in Proposition 13.31 
holds, and therefore Bl~\D^-3{Kf n K'^)) = Kf. □ 

Example 4.7 Recall that Aj = {Aj(-) > A°}. So, if we take Ci = K, then 
r7*(Ai, A2) is an equilibrium if and only if whenever a player does not f -believe 
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that "Ai and A2 are high enough" is a common- f -belief , he (1 — g)-believes 
that the fact that this is not a common- f- belief is a common-{l — g)-belief. 

5 Prisoner's Dilemma 

We now apply the results from previous chapters to the Prisoner's Dilemma, 
thus expanding Example 13.21 

First we will observe the following regarding a larger class of games: 

Lemma 5.1 Assume player i has only two actions and Ti. Then g] = 1, 
for u i Ai, gl{uj) = gf{uj) = 1, and for u E Ai, gf{uj) = fi{uj) < gf{uj). 

Proof: Hi = 0, and therefore gl = 1. Because there are only two actions, 
Aj = |w e I j^^i^ - iui{ai, Tj) + Ui{a)j^^^^^) > o|, and therefore if a; ^ 
Aj then from the definition of gf it follows that gf{uj) = 1. If G Aj, it follows 
that gf{uj) = fi{u}). In this case the inequality gf{uj) < gf{uj) can be checked 
arithmetically, but one can observe that if w G Aj then player i does not 
profit by deviating when the profile r* is played, therefore the deviation that 
leads to gf is more profitable for him then the deviation that leads to gf. 

Because A^ = |w G 1] | j^j^ - {ui{ai,Tj) + Ui{cr)j^^^) > o|, if w ^ 
Ai we have jzj^ - {u^{(Ti,Tj) + Mi(o-)iri^) < or Uiij) - Ui{ai,Tj){l - 
Xiiu)) — Ui{a)\i{uj) < 0, which implies that Hf = 0. □ 

Corollary 5.2 1. Let Ci C Aj for i = 1,2. If both players have only 
two actions, then to verify that r]*{B({Df{Ci n C2)), B({Df{Ci n C2))) 
is a Bayesian equilibrium, one only needs to verify that Pi{Bj{D^{Ci H 
C2)) I uj) < f,{uj) for every 00 e A,\ B({Df{Ci n C^)) fori = 1,2. 

2. r]*{B({D^{A)), B({Df (A))) is an equilibrium. 
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Note that in the last case both players cooperate whenever the fact that both 

players have high enough discount factor is a common- /-belief. 

Proof: 

1. This follows from Theorem 13. 3[ Lemma [4.5( 1) and Lemma [5.11 

2. Assume there exists u* E Ai \ B({D^{A)) such that Pi(5((D-^(A) | 
CO*) > fi{uj*). Denote Ki := B({Df{A) U {oo*} and K2 := B[{D^{A). 
From the assumption, Pi{Kj \ u) > fi{uj) for every u E Ki for i = 1,2, 
in contradiction to the maximality property of D{ (Aj, Aj) = B({D^{A) 
as shown in Lemma [4.5( 1). □ 

Remark 5.3 If we define fi to he at least 1 outside Aj (see Remark \3.d{ 3)). 
Corrolary \5.2\ holds for every game which satisfies fi > gi for i = 1,2. 

It is easy to calculate that for Prisoner's Dilemma with the payoffs as in 
Example 13. 2[ fi{uj) = ^2\ '(t) ■ Therefore, from the last corollary we have: 

Corollary 5.4 Let G be the repeated Prisoner's Dilemma where each player 
has incomplete information regarding the other player's discount factor. De- 
fine fiiuj) = ''2\ '(P) • ^^6^ J'^^'^ ^/ strategies where each player plays as 
follows: If you f -believe that it is a common- f -belief that Ai, A2 > \, play the 
grim-trigger course of action. Otherwise, always defect, is an equilibrium. 

Therefore, both players will cooperate when Ai, A2 > | is a common- /-belief. 

Note that there cannot be larger events than B{[D^ [A)), B^i^D^ [A)) such 
that the strategy profile 77* is an equilibrium, but it may be an equilibrium 
with smaller ones. See next section for examples. 
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6 Examples 



In all the examples in this section, Q C [0, 1)^, and we interpret the coordi- 
nates of w G as the players' discount factors, i.e. X{ijj) = (Xi^u), A2(w)) = 
u. In all the examples each player knows his own discount factor (i.e., player 
i knows coordinate i of u), and their belief regarding the other player's dis- 
count factor depends only on their own discount factor. 

In examples 16. m6.4[ G is the repeated Prisoner's Dilemma, with different 
information structures. In these examples corollaries 15.21 and 15.41 hold. 

In the following example S/ {D^{A)) ^ 0, and therefore we get the equi- 
librium of Corollary 15.41 but we can also get an equilibrium with smaller 
sets. 

Example 6.1 LetQ = {1/4,1/2,3/4}^, and each player assumes a uniform 
distribution on the other player's discount factor (regardless of his own dis- 
count factor). Equivalently, there is a common-prior of uniform distribution 
on Vt. 



Because A? = = 1/3, it follows that Ai = {1/2, 3/4} x {1/4, 1/2, 3/4} 
and A2 = {1/4,1/2,3/4} x {1/2,3/4}. Therefore we get that for every u, 
Pi(A2 I u) = 2/3. Smce /i(3/4) < /i(l/2) = = 1/2 < 2/3 ti;e get 

dI'^ {Ai,A2) = Ai. Similarly, {k2Ai) = and therefore B({Df (A)) = 
D({Ai, A2) = Ai. From Corollary \5.4\ we get that 



Set Ci = {3/4} X {1/4,1/2,3/4} and C2 = {1/4,1/2,3/4} x {3/4}, 
then Pi{C2 I oj) = 1/3 for every u. Since /i(3/4) = 1/6 > 1/3, we get 




defines a Bayesian equilibrium. 
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Dl'f{Ci,C2) = Ci. Similarly, Dl'f{C2,Ci) = C2 ,and therefore B{{D^{Ci n 
C2)) = Dl{Ci,Cj) = Ci. To use Corollary \5.lY l) we need to verify that 
Pi{Cj I (jj) < fi{uj) for every u E Ai\Ci. If u E Ai \ Ci then Xi{u) = 1/2, 
and indeed in Pi{Cj | w) = 1/3 < 1/2 = /j(l/2). Therefore, 



also defines a Bayesian equilibrium. 

In the following example i?/ {Df{A)) ^ 0, and therefore we get the equi- 
librium of Corollary I5.4[ but there are no smaller non-empty sets that define 
a Bayesian equilibrium. 

Example 6.2 Let Vt and A he as in the Example \6.1\ hut change the heliefs 
as follows for i = 1,2: If Xi = 3/4, player i helieves that Xj = 3/4, if 
Xi = 1/2 he helieves that Xj = 1/2 with prohahility 1/3 and that Xj = 3/4 
with prohahility 2/3, and if Xi = 1/4 he helieves that Xj is uniform distrihuted 
over {1/4, 1/2,3/4}. Thus, each player helieves that the other player has a 
discount factor at least as high as his own. 

Aj is the same as in Example \6.1\ and B{{D-f{A)) = Aj so that 



defines a Bayesian equilihrium. 

If we take Ci as in ExamplelMwe still get B( {Df{Ci H C2)) = Ci, hut 




T* A, = 3/4, 

ai A, = 1/4,1/2, 




now Pi{Cj I oj) = 2/3 > 1/2 = fii^uj) for u E Ai \ Ci. Therefore 
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does not define a Bayesian equilibrium. Similarly, 'r]*{Ki, K2) is not a 
Bayesian equilibrium for any non-empty i-measurable events Ki <Z (a 
strict subset), z = 1,2. 

In the following example S/ (D-^(A)) = 0, and therefore there is no 
Bayesian equilibrium with cooperation of the type described here. 

Example 6.3 Let f2 = {1/4,2/5}^, and X{lo) = (Ai(cj), A2(w)) = uj. Each 
player knows his own discount factor, and if Xi = 2/5 player i believes that 
Xj = 1/4 with probability higher than 1 — /j(2/5) = 1/4 (the rest of the beliefs 
are irrelevant). Here Aj = {Aj(-) = 2/5}, and so Pi{Aj \ u) < 1/4 = fi{uj) 
for CO G A,. Therefore D]'f{Ki,Kj) = ^ and B^{D^{A)) = D{{A,,Aj) = 0, 
so that we do not get an equilibrium with cooperation of the type rj* (because 
D{ (Aj, Aj) is the largest set where such an equilibrium is possible, and here 
it is empty, we cannot get it for any set). Note that according to Corollary 
\5.S\ rj*iB{{DU h)). bUdU A))) is an equilibrium, but in this case, because 
D^{A) = 0, it's the trivial equilibrium in which the players always defect in 
every state of the world u. 

In the following example B{ {Df{A)) ^ but B{ {Df{A)) ^ A^, as was in 
the previous examples whenever B{ (D-^(A)) was non-empty. 

Example 6.4 Let Q = (0, 1)^, and assume that each player believes that the 
other player's discount factor is uniformly distributed, regardless of his own 
discount factor. S is the Borel a-algebra over (0,1)^. Si contains the sets 
B X (0, 1), for every Borel set B C (0, 1), and S2 contains the sets (0, 1) x B, 
for every Borel set B C (0, 1). 

Here Ai = [1/3, 1) x (0, 1) and A2 = (0, 1) x [1/3, 1). We will see that 
B({D^{A)) = [1/2,1) X (0,1), i.e., a strict subset of Ai (and an analog 
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statement holds for player 2): because Pi(A2 | u) = 2/3 for every uj, and 
/i(3/7) = 2/3 we deduce that ^{'^(Ai.As) = [3/7,1) x (0,1). Therefore 
Pi{D2^ I u) = 4/7 for every u, and because /i(7/15) = 4/7 we deduce 
that dI'^{Ai,A2) = [7/15,1) x (0,1). We continue the same way and de- 
duce that for every k there is n such that D^'-^(Ai, A2) = [2^+!' ^) ^ ■'-)•' 
PiiDl \co) = 



2^ for every u, and because fi (^ 2i2^n+i)+i ) 



1 



2n+l 



we 



deduce that D^^^'-^ {Ai, A 



2n+l 



2(2n+l)+l ' 



(0,1). Therefore B{{Df {A)) 



D((Ai,A2) = {^^y^D{^ = [1/2,1) X (0,1). One can easily check that indeed 
Pi{B^ {D^iA)) I to) > fiico) for every tu G B({Df{A)) fori = 1,2. Note that 
here the construction of D( requires an infinite number of steps - for every 
k we have D^^^'^ ^ D^'^ . 

From Corollary \5.4\ we deduce that 



r* A,>l/2, 
ai Xi < 1/2, 



defines a Bayesian equilibrium. 

In examples 16.5116.61 G is the following repeated game: 





D 


C 




D 


1,1 


4,0 


■,o 


C 


0,4 


3,3 


a 


N 


0,- 


a, ■ 





with a > 4 (payoffs not indicated can be arbitrary). Here a = {D,D), 
T = (C, C), and = A2 = ^zf- these examples Theorem 14.61 holds, with 
W) = afiuj) = g] = ^ and gf{uj) = jj^^^^y 

In the following example i?/ (-D-^(A)) 7^ for z = 1, 2 but these sets does 
not satisfy the conditions in Theorem 14.61 (or equivalently those in Theorem 
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13. 3p . Moreover, we prove that in this example there are no non-empty Ki, K2 
such that ri*{Ki,K2) is a Bayesian equihbrium. 



Example 6.5 Let a = 6, the belief space as in Example \ 6.4\ Here Ai = 
[3/5,1) X (0,1) and A2 = (0,1) x [3/5,1), and because fi{u) < /.(3/5) = 
1/3 < 1 — 3/5 = Pj(Aj I oj) for every uj G Aj, D\{ki,k.^ = Aj and therefore 
D{{Ai,Aj) = Ai. But, for co ^ A„ Pi{\, \ u) = 2/5 > 1/3 = gj > g,{u) 
and therefore the second condition of Proposition \3.3\ does not hold, and 
7]*{B{{Df [A), Bl{Df [A)))) does not define a Bayesian equilibrium, unlike 
previous examples. 

Now we prove that there are no non-empty Ki, K2 such that 7]*{Ki, K2) is 
a Bayesian equilibrium. Suppose that ri*{Ki,K2) is a Bayesian equilibrium, 
and that Ki,K2 are non-empty cooperation events. Denote, for i = 1,2, 
X* := mi{Xi{u)) I u G Ki}. Since Ki C Aj, A* > 3/5. Note that since the 
beliefs are derived from the uniform distribution, Pi{Kj \ u) is independent of 
the state of the world u, so we denote it by Pi{Kj). From the first inequality 
of Theorem \3.3[ we have Pi{Kj) > fi{uj) for every u G Ki, and since fi is 
continuous, Pi{Kj) > /j(A*). We now argue that if \i{u) > A*, then u G Ki. 
Otherwise we have, from the second inequality of Theorem \3.3\ that Pi{Kj) < 
9i{(^) < gf{^) = M^)- But f,{uj) < fi{X*) < Pi{Kj), since \i{uj) > A*, zn 
contradiction. Therefore we have that Pi{Kj) = 1 — A*. Next, we argue that 
Pi{Kj) = /i(A*). Otherwise Pi{Kj) > fi{\*i), and there is a state of the world 
G such that Pi{Kj) > fi{uj) > /i(A*). From the definition of X*, we have 
00 ^ Ki, but then we should have Pi{Kj) < gf{uj) = fi{oj) in contradiction. 
We conclude that 1 — A* = /i(A*) for i = 1,2. Because fi{Xi) = we 
have that = 1 - A* = 2A*(1 - X^), or equivalently, XIX2 = 1/4, in 
contradiction to X* > 3/5 for i = 1,2. 
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In the following example -B/ (D-^(A)) 7^ and does not satisfy the condi- 
tion in Theorem I4.6[ but for a smaller non-empty set, the conditions hold, 
and therefore define a Bayesian equilibrium. 



Example 6.6 Let a = 5, and Q = {1/4, 1/2,3/4}^ with a common-prior of 
uniform distribution. Here = 1/2 and therefore Aj = {Aj(-) > 1/2}. As 
in the Example \6.5\ B{{D^{A)) = D{ (Aj, Aj) = Aj but the second condition 
of Proposition ISTSi does not hold because of gj: Pi(A2 | (1/4, A2)) = 2/3 > 



For Ci = {Xi = 3/4}, we get Dj {Ci, Cj) = Ci and the second condition of 
Proposition \3. 3\ does hold (this can easily be verified by calculation) . Therefore 



In this section, we generalize the results from the previous chapters, for the 
case of e-equilibria. 

First, we prove the following conditions for Bayesian e-equilibrium with 
cooperation, similar to Theorem 13.31 

Theorem 7.1 Let e > 0. In the game G, the strategy profile ■r]*{Ki,K2) = 
{rjl{Ki) , ri2{K2)) , such that Ki C Ai and K2 C A2, is an e-equilibrium for 
every u E Q if and only if, for i = 1,2, 




defines a Bayesian equilibrium. 



7 e-Equilibria 



1. Pi{Kj I oj) > fl{<jj) for every u E Kj 
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2. Pi{Kj I u) < glioj) for every uj ^ Ki. 

fl and gl are i-measurahle functions that will he described in the proof, and 
they tend to fi and Qi when e tends to 0. 

Proof: The proof is similar to the proof of Theorem I3.3[ with the same 
options for deviation in each case, and the payoffs are the same, only here we 
assume 'yi{T]*{Ki, K2) \ u) > 'yi{-,r]*{Kj) \ oo) - e instead of 'yi{r]*{Ki, K2) \ 



Case 1: a; G Ki. 

• Deviation to a'** — in this case ■yi{ri*{Ki, K2) \ u) > 'yi{a'** ,r]*{Kj) \ 
w) - e trivially holds since ■yi{T]*{Ki, K2) \ u) > •ji^a'** ,r]*{Kj) \ u) for 
every w G Aj. 

• Deviation to a'* — here instead of inequality ([T]) we get 




+ (1 - PiiKj I uj)){ui{Ti,aj) - Mi((T-,crj)) > -e-(5) 



or equivalently Pi{Kj \ u) > fl{<jj) where //(w) is defined by: 




where Fi 



{a[ I M»(r,,aj) < Ui{a[,aj)}. 



Case 2: uj ^ Ki. 



• Deviation to a'^ 



here instead of inequality ([2]) we get 



Pi{Kj I uj){{ui{auTj) -Ui{a[,Tj)) 



(n,(cr) - U,(o--,crj))) + 

+ {ui{a) -Ui{a[,aj) > -e. 
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or equivalently Pi{Kj \ u) < g^, where 



g- := mm 



and Hi := {cr,' ^ ai,Ti \ Ui{ai,Tj) < Ui{a'i,Tj)}. 
Deviation to — here instead of inequahty ([3]) we get 
Pi{Kj I uj) \ ui{cTi, Tj) + Ui{a) (^Y^X^ 



1 - Xiioo) 

-{ui{a) - Ui{Ti, cTj))) + +{ui{a) - Ui{Ti, aj)) > -e, 



or equivalently Pi{Kj \ u) < gf^i^u)) where 



whenever Ui{ai,Tj) + u,{a) (t^x|^) < (l^^Jy) and gf'{uj) := 1 oth- 
erwise. 

• Deviation to a'J^ — here instead of inequality dl]) we get 

Pi{Kj I u) {ui{Ti, cxj) - Ui{a) + Ui{ai, tj) - ni(r) + 

+ (Mi(cr) - Mi(o-., Tj)){\i{uj))) + (Ui(cr) - Ui^Ti, aj)) > -e, 

or equivalently Pi{Kj \ u) < gf^^uj) where 
and 

Hf{u) := {(J- 7^ Ti I Mi(r) - Mj(cTi, Tj) + (Mi(cr., Tj) - (Mi((T)))Aj(w) < 0} . 

Set gi{ijj) := minj^fj?^", gf''{ijj),gf''{u)}. The proof that and (y^l are i-measurable 
is the same as in the proof of Theorem I3.3[ □ 
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Remark 7.2 Note that the conditions given in this theorem are under the 
assumption that Ki C Ai and K2 C A2, and therefore the theorem does not 
describe necessary conditions for ri*{Ki, K2) to be an e-equilibrium. For ex- 
ample, define \^ := min |Aj | — {ui{a^, tj) + Ui{a)j^^) > — e Vcr^ 7^ rj| 
and := {u E Q \ Xi{uj) > i = 1,2}. Then, when the information 
regarding the discount factors is complete, t]*{A^,A'^) is an e-equilibrium, but 
generally A*^ ^ Aj for i = 1, 2. 

Corollary 7.3 Theorem \4.6\ holds for e-equilibrium with the following ad- 
justments: First, instead of f and g there should be and g^; and second, 
in the second part of the theorem, the assumption that Ki C Aj for i = 1,2 
should be added. 

Proof: The proof is exactly like the proof of Theorem 14.61 □ 

The following lemma shows that to /'^-believe in an event or to 1 — g"^- 
believe in an event are weaker demands than to p-believe in an event, for 
p < 1 sufficiently large. This property will be used in Section [HI 

Lemma 7.4 Let Ki C Aj for i = 1,2. For e > small enough, there exist 
constants Mi,Ni > such that < (w) < 1 — e/Mj for every u G Ki and 
gliuj) > e/Ni for every u ^ Ki. 

Proof: Let a'- be player i's best response to tj in F, and a"' the worst 
response. It follows, from the definition of that 



Denote Mj := 2(Mj(cr) — Uj(rj,o"j)). There exist Xj > A° such that, for every 



^« e < M,/2. Therefore, for every 




e 
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uo eKi such that \i{uj) G [A°, A,^], //(w) < 1 - t/Mi. If Xi{uj) > X}, we get 
that 



1-A.M l-AOy* ""^^"^^ 1-A^ 



> (Mi(r) - Ui{a)) ( - ) =: M. 



. 1 - Al 1 - A^ 

and therefore fl{uj) < m'+mI/2 ■ ^'^^ ^ small enough we will get f[{uj) < 
1—e/Mi for every u G Ki. flioj) > for every u E Ki ii e < Ui{a) — Ui{Ti, aj). 
From the definitions of gl"^, gf'^ and g^'^ we get that, for every u ^ K^, 

u ■ ^ 

g^ > mm 



and 



Ui[a) - Ui[Ti,aj) 



Therefore gi{uj) > e/Ni for every u ^ Ki. □ 

Next, we generalize the results of Section |5] to e-equilibria in games where 
both players have only two actions. 

The following lemma is the e-equilibrium equivalent of Lemma 15.11 

Lemma 7.5 Assume player i has only two actions a-i and Ti. Then, for every 
e > 0, gl'^ = 1, foru ^ ki, g^'^u), gf{uj) > 1, and for u G Ai, gi'\uj) > fl{u) 
and g}\uj) > gl\uj)- 

Proof: All the results, except gf^'^ioj) > gf''^{uj) for u G Aj, follow from 
Lemma \5.1\ since gi'^ > gi for k = 1,2,3 and fi > fl- The same reasoning 
that shows gfioj) > gf{uj) for uj G Aj, gl^'^iu) > gf^{u}). □ 

The following corollary is the e-equilibrium equivalent of Corollary 15.21 
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Corollary 7.6 1. LetCi C Aj fori = 1,2. If both players have only two 
actions, then to verify that r]*{B{\Df\C 1^02)), Bi\Df\C 1^02)) is 
a Bayesian e- equilibrium, it is sufficient to verify that Pi{Bj {Df\Ci n 
C2)) I uj) < fl{uj) for every 00 E K\ Bf{Df\Ci n C2)) for i = l,2. 

2. r]*{Bf{Df\A)),B[{Df\A))) is an e- equilibrium. 

Proof: 

1. This follows from Theorem 17. H Lemma [4.5( 1) and Lemma [7.51 

2. The proof is similar to the proof of Corollary 15.2( 2). only now it follows 
from Corollary 17.6( 1) instead from Corollary 15.2( 1). □ 

8 "Almost" Complete Information 

In this section we look at belief spaces that are "almost" complete, in sev- 
eral senses, and see whether the cooperation events when the information is 
almost complete are close to the cooperation events when the information is 
complete. 

8.1 Knowing Approximately the Discount Factors 

The first sense of "almost" complete information we consider, is when each 
player knows the other's discount factor, up to a range of e > 0. That is, 
player i knows his own discount factor, and is given a signal x so he knows 
that player j's discount factor is between x — e and x + e. As the following 
example shows, in this case the cooperation events are significantly different 
from the complete information cooperation events, even in the repeated Pris- 
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oner's Dilemma, regardless of e, and even if we consider only ^-equilibrium 
for some 6 > 0. 

Example 8.1 Let G be the repeated Prisoner's Dilemma. Let Q = (0, 1)^, S 
is the Borel a-algehra, and as in previous examples, X{uj) = (Ai(a;), A2(w)) = 
u for every u E Q. Each player knows his own discount factor, and believes 
that the other's discount factor is within e > of his own; if Xi{uj) = x, 
then player i believes that Xj is uniformly distributed in {x — e,x + e) ( with 
cutoffs if one side exceeds or 1). Again, Ai = [1/3, 1) x (0, 1) and A2 = 
(0, 1) X [1/3, 1). We will show that B({Df{A)) = [1/2, 1) x (0, 1) regardless 
of e (fore small enough): 

Let 6 > 0. Because fi is a continuous monotonically- decreasing function, 
and because /i(l/2) = (1/2), there exists 6' > such that /j(l/2 — 5') = 1/2 + 
6. Therefore, any state of the world u such that Xi{u)) < min{l/3 + 2e5, 1/2 — 
6'} is not in D]'^ {Ki, Aj). That is because for such states of the world u, one 
hasP,{Aj I to) = Pi{Xj > 1/3 | u) < 1/2+5 = fi{l/2-S') < fi{uj). Therefore, 
for some x^ > min{l/3 + 2e5, 1/2 - 5'}, Di'^(Ai, A2) = [x\ 1) x (0, 1), and 
similarly for player 2. Therefore, any state of the world u such that Xi{uj) < 
min{a;"^ + 2e5, 1/2 — 6'} is not in Df'^{Ai,Aj) by the same reasoning, and 
L'i'^(Ai,A2) = [x2,l) X (0,1) forx'^ > min{a;^ + 2e5, 1/2 - 5'}. We continue 
until we get that any state of the world u such that Xi{uj) < 1/2 — 6' is not in 
D{ (Aj, Aj). This is true for every 6 > 0. Because 6' tends to as 6 tends to 
0, D({Ai, A2) C [1/2, 1) X (0, 1) (similarly for player 2). The equality follows 
from Lemma \4^ 1 ), because [1/2,1) x (0,1) and (0,1) x [1/2,1) fulfill the 
inequality. 
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We get from Corollary 5.4 that 

, j r* k> 1/2, 
[(Ti Xi< 1/2, 

defines a Bayesian equilibrium and that there is not such an equilibrium with 
larger sets of cooperation, regardless of e. Therefore, under this profile the 
players cooperate when both discount factors are no less than 1/2, regardless 
of t, whereas is the complete information case they can cooperate whenever 
both discount factors are no less than 1/3 (see Example \3.^) . 

Note that this analysis does not change ifVL = (0, l)^n{(x, y) : \x—y\ < e}, 
which verifies that the true state of the world is within the support of the 
beliefs of the players. Also, if for every state of the world u player i believes 
that Xj is distributed in any non-atomic symmetric way around Xi{uj), the 
result still holds. 

For a Bayesian 6 -equilibrium we follow the same route, with f^ instead 
of f (see Corollary 1.0( 2)). Instead of a threshold of 1/2, we show that 



B(\d^\A)) = [^,1] X (0,1). That IS because f^^) = 1 - We 
get a lower threshold, which is still independent of e, and, for low enough 6, 
higher than 1/3. 

8.2 The True Discount Factors are Common-(l — e)- 
Belief in Most States of the World 

In this section we assume a common prior P over the states of the world. 

Definition 8.2 Let e,6 > 0. We say that the discount factors are almost 
complete information with respect to e and 6, if the set of states of the world 
in which the true discount factors are common-{l — t)-belief has probability 
at least 1 — 5. 
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From Theorem B in Mondrer and Samet (1989), we have that if the 
number of states of the world is finite, then in this case there are a strategy 
profile rj and an event Q' with probability at least (1 — 2e)(l — 5), such that: 
(a) i]{uj) = 77* (A, A){u) for every u G Q'; and (b) rj is an e'-equilibrium for e' > 
4Me (M is the maximum of the absolute value of the payoffs in the repeated 
game, taken over all the discount factors). In other words, there is an e'- 
equilibrium profile that coincides over a large set with the conditional-grim- 
trigger equilibrium of maximum cooperation in the complete information 
case. 

However, as Example 18.31 shows, this profile t] may not be a conditional- 
grim-trigger profile. It shows that there may be no conditional-grim-trigger 
e'-equilibria whatsoever, unless there are only two actions for each player (see 
Corollary E5]). 

Example 8.3 Let G be the game as in Example \6.6[ Let the information 
structure be as follows: each player has two possible discount factors. That 
is, 

S = {{H,, H,), {H,, L2), (Li, H2), (Li, L2)}, 

where Hi is the higher discount factor of player i, and the lower one. 
Assume Li < 1/3 < X'^ < Hi for i = 1,2. In every state of nature, each 
player gets a signal regarding the other player's discount factor, which may 
be either hi or li, that is Q = S x {hi,li} x {/i2,^2}; the first two coordinates 
are the discount factors, and the last two are the signals. Each player knows 
only his discount factor and his signal. 

The common prior on Q is as follows: the state of nature is chosen according 
to the distribution 

(1 - 6){H,, H2), ^(Li, L2), ^(Li, H2), ^(ifi, L2) . 
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In every state of nature, there is a probability 1 — e that both the signals are 
correct, and the other three states of the world that correspond to this state 
of nature have equal probability e/3. For example, if{Hi,H2) is chosen, the 
distribution over the states of the world is 

"(1 - e)(//i, H,, h, h), H,, h, h2), H2, h, k), H2, h, k)] . 

Assume that 5 <^ e. 

We now calculate the beliefs of player 1, up to normalization, that is, the sum 
of the coefficients in the following equations may not be 1. From symmetry, 
the beliefs of player 2 are similar. 

"((1 - - 5))iH^, H2, h, h2), ^(//i, H2, /ii, ^2) , 
f (-f^i, L2, hi, /i2), ^{Hi, L2, hi, I2)] , 

Pi{{Li,.,h,.))^ 

^"^-^{Li, L2, h, h), ^{Li, L2, li, h2) , 
^(Li, H2, li, I2), ^(-^1) H2, h, /i2)] , 

Pl{{Hi,;li,-))^ 

'^{Hi, H2, h, h2), ^-^{Hi, H2, h, I2), 
^iHi, L2, h, I2), {Hi, L2, h, /i2) , 

and 

Pl{{Li,;hi,-)) = 

[^(-^1) L2, hi, I2), ^{Li, L2, hi, h2), 

'-i{Li,H2,hi,h2),^-^{Li,H2,hi,l2) . 

Note that for ou — {Hi, H2,hi,h2), which happens with probability (1 — e)(l — 
S), the true state of nature is a common-{l — e)-belief, and therefore the 
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discount factors are almost complete information with respect to e and e + S, 
according to Definition \8.'A 

We now prove that there is no conditional-grim-trigger e' -equilibrium for 
small enough e' , independent of t and 5. Let e' he such that gfiLi) < 1 and 
ff {Hi) > for i = 1,2. These inequalities are satisfied by e' low enough, 
since (a) lim^/^o //' = ft, lim.'^ofi'l' = Qi, and (b) fi{Hi) > and gi{Li) < 1 
because < A° < Hi. Note that since Lj < A° < Hi for i = 1,2, we have 
Ai = {{Hi, ■, ■, ■)} and A2 = {(■, H2, ■, ■)}. Assume also that for i = 1,2, e' < 
1 — 2Li/ {1 — Li) (we assumed 1/3 > Li so 1 > 2Lj/(l — Li) ). In this case In- 
equality ^ does not hold for a' = D, for any Pi{Kj \ uj). Suppose rj*{Ki, K2) 
is an e'- equilibrium profile with non-trivial cooperation events. Since In this 
case Inequality ^) does not hold for player i when his discount factor is Li, we 
have Ki C Aj for i = 1,2, and so the conditions of Theorem \7.1\ holds. Sup- 
pose that {{■, H2, -,12)} ^ K2. ThenPi{K2 \ {L^, H2, hi,^)) = l-t>gi{L{) 
fort low enough, which is a contradiction since {Li, H2, hi,l2) ^ Ki. There- 
fore K2 = {(■, 7^2, ■) ^2)}; an by a similar argument Ki = {{Hi,-, hi,-)}. 
But then we have Pi{K2\{Hi, H2, hi,l2)) < e < fl' {Hi) for low enough e, 
in contradiction to ri*{Ki, K2) being an e' -equilibrium. Therefore there is 
no conditional-grim-trigger e' -equilibrium profile with non-trivial cooperation 
events. 

The following lemma is a result of this concept of almost complete infor- 
mation: 

Lemma 8.4 Set e > and S > 0, and assume that the discount factors are 
almost complete information with respect to e and S. Then P{A\D^^'^{A)) < 
6. 
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Proof: Denote K = {u E Q \ X{uj) is not a common-(l — e)-belief in u}. 
From our assumption P{K) < 6. Since A \ D^^'^{A) C K, we have P(A \ 
D^-'{A)) < 6. □ 

In the case where each player has two actions, Lemma 18.41 imphes that 
there is an e'-equihbrium with cooperation events close to A: 

Corollary 8.5 Suppose each player has two actions. Let e > and 6 > 
and M := 2ma.Xi=i^2{ui{cr)—Ui{Ti,aj)). Assume that the discount factors are 
almost complete information with respect to e and 5. Then, for every e' > Me, 
the strategy profile ri*{Bf {Df" {A)),B[ {Df" (A))) is an e' -equilibrium, and 
P{A\Df'\K)) <6. 

Proof: From Corollary EM2) we have that r/*(5f (A)), 5f '(D^''(A))) 
is an e'-equilibrium. From Lemma [7.41 we have that ff{uj) < 1 — e for every 
u e B[' {Df^K)). Therefore D^'^K) C Df\k) C A, so from Lemma EH 
we have that P(A \ Df^K)) <5.U 

8.3 Each Player (1 — e)-Believes that a State of Nature 
is Common-(l — e)-Behef 

In this section we assume that the information is almost complete in a dif- 
ferent sense: 

Definition 8.6 Let e > 0. We say that the discount factors are almost 
complete information with respect to e, if for every state of the world u, each 
player {1 — e)- believes in u that some state of nature is common- {1 — e)- belief 
in uj. 

We show that when the discount factors are almost complete information 
with respect to e, according to Definition 18. 6[ there is a simple conditional- 
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grim-trigger profile, which is an e'-equihbrium for e' > Me (M > is a con- 
stant which depends only on the game and not on the information structure). 
While this assumption can hold without the information structure derived 
from a common prior on the state of the world, we show that if it does, 
this concept of almost complete information is stronger than the concept in 
Definition 18.21 Also, under this assumption (with the common prior), in the 
simple conditional-grim-trigger e'-equilibrium mentioned above the players 
cooperate in all the states of the world in which there is a cooperation un- 
der the conditional-grim-trigger equilibrium of maximum cooperation in the 
complete information case, ri*{A,A), but for a set with small probability. In 
other words, under a stronger concept of almost complete information, we 
have a result which is quite similar to the one in Theorem B of Mondrer and 
Samet (1989), but with a conditional-grim-trigger profile which is defined 
explicitly for every state of the world. 

Proposition 8.7 Let e > 0. Assume that the discount factors are almost 
complete information with respect to e, according to Definition \8.6i . Then, the 
strategy profile r]*{Bl^^{D^~^(A)), B2^'^{D^^'^(A))) is an e'-equilibrium, for 
every e' > M e, where M > is a constant, independent of the information 
structure and of e. 

Proof: Denote Ki := Bl'^D^-^A)). Note that Ki C for i = 1, 2. From 
Lemma [7.41 there is an M > such that for every e' > Me and i = 1, 2, we 
have fi'{ui) < 1 — e for every u G and gf (uj) > e for every u ^ K^. 

For every oo G Ki, we have Pi{Kj \ u) > Pi{D^-\A) \uj)>l-e> ff{uj). 
For every u ^ Ki, we have from our assumption that player i {1 — e)-believes 
in u that some state of nature is a common- (1 — e)-belief in u, but this state 
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of nature is not in A. Therefore, uj G B] ^[D^ '^[K'^)). Therefore we have 

Pi{Kj \ uj) = l- Pi{K;r \u) = l- Pi{Bj-'{D'-%A^)) I u) < 
< 1 - Pi{D^''{A') \ uj)<e< gfiuj). 

From Theorem 17. II we have that ri*{Ki, K2) is an e'-equihbrium. □ 

Remark 8.8 1. Under this strategy profile, both players will cooperate if 
and only if u e D^-'{A), since B]'\D^-\K)) C A^. 

2. D^-%K) may he empty, and then t]*{BI^'{D^-'{A)), Bl-'{D^-%A)) a 
trivial profile with no cooperation. 

Proposition 8.9 Let e > 0. Assume that (a) the information structure is 
derived from a common prior P; (b ) the discount factors are almost complete 
information with respect to e, according to Definition \8.6^ and (c) the number 
of state of nature is finite or countable. Then the discount factors are almost 
complete information with respect to e and 3e according to Definition \8.2[ 

Proof: Denote by K the set of states of the world u such that there is a 
state of nature which is a common- (1 — e)-behef in u. From assumption (b) 
we have that for every state of the world u, Pi{K \ u) > 1 — e. Therefore 
P{.K) > 1 — e. For every state of nature A, denote := {u eVL\ X{uj) = A}, 
and Q' = [j^{G^nD^-%G^)). Our goal is to show that P{n') > l-3e. This 
is shown in the proof of Theorem B of Mondrer and Samet (1989). □ 

From Propositions 18.71 and 18.41 Remark 18.81 and Lemma 18.41 we conclude 
the following, which is the generalization of Corollary 18.51 for games with 
more than two actions for each player: 

Corollary 8.10 Let e > 0. Assume that (a) the information structure is 
derived from a common prior P; (b) the discount factors are almost com- 
plete information with respect to t, according to Definition \8.6{ - and (c) the 
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number of state of nature is finite or countable. Then (A) the strategy profile 
r]*{Bl^''{D^^^{A)), Bl^^{D^^''{A)) is an e'- equilibrium, for every t' > Me, 
where M > is a constant, independent of the information structure and of 
t; (B) under this strategy profile, both players will cooperate if and only if 
uj e D^-'{A); and (C) P{A \ D^-'{A)) < 3e. 

Example 8.11 Let G, S and Q be as in Example \8.3[ The common prior is 
as follows: the state of nature is chosen uniformly. In every state of nature, 
there is a probability 1 — e that both the signals are correct, and the other 
three states of the world that correspond to this state of nature have equal 
probability e/3. For example, if {Hi,H2) was chosen, the distribution over 
the states of the world is 

We now calculate the beliefs of player 1. From symmetry, the beliefs of player 
2 are similar. 

P^{{H,,■,h^,■)) = 

[{1 — e)(ifi, H2, hi, /12), ^{Hi, H2, hi, I2) , f (-f^i, L2, hi, /12), f (-f^i, -^2, hi, l2)\ , 
Pi{{Li,-,li,-)) = 

[(1 — e)(Li, L2, h, I2), f (-^1, L2, h, h2) , f (-^^i, H2, h, h), f (-^i, H2, h, /i2)] , 

PliiHi,;li,-)) = 

[(1 — e){Hi, L2, h, h2), ^(Hi, H2, /i, ^2), ^{Hi, L2, h, h), ^{Hi, H2, h, /i2)] , 
and 

Pi{{Li,-,hi,-)) = 

[{1 - e)(Li, H2, hi, I2), f (Li, L2, hi, /is), f (Li, H2, hi, /la), f (Li, L2, hi, h)] • 
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Note that indeed, for every state of the world u, each player (1 — t) -believes 
in u that some state of nature is a common-{l — e) -belief in u, so the dis- 
count factors are almost complete information with respect to e, according to 
definition \8.6i . 

As in Example \8.3[ Ai = {{Hi, ■,■,■)} ^''^d A2 = {(■, H2, ■,■)}, and so A = 
{{Hi, H2, ■,■)}■ Therefore D'-' = {{Hi, H2, hi, h2)} , and so B\'\D^~\K)) = 
{{Hi,-, hi,-)} and Bl~'{D^-'{A)) = {{-,H2,-,h2)}. Therefore, from Corol- 
laru \8.1[h we conclude that rj*{{{Hi, -, hi, ■)}, {(-, H2, -, h2)}) is an e' -equilibrium 
for every e' > Me. Under this strategy profile each player cooperates if both 
his discount factor and his signal are high, and both players cooperate if the 
state of the world is {Hi, H2, hi, h2) ■ Indeed, as in Corollary \8.10[ we have 
that P{A \ {{Hi, H2, hi, h2)}) = e < 3e. 

Note that if Hi and H2 are high enough, we have Pi{A \ u) > fi{u)) > 
ff{uj) for every u e Ai, and therefore Bf {D^ {A)) = Bf {D^' (A)) = A^ 
fori = 1,2, and D^^ {A) = D^'' {A) = A. This implies that ?7*(Ai,A2) may 
be an equilibrium or an e' -equilibrium with larger cooperation events. But, 
if e is small enough, we have Pi(A2 | {Li, H2, h2,li)) > 1 — e > gl'^ > 



gl'" {{Li, H2,h2,li)), so the second condition of Theorem \7.1\ does not hold 
and r7*(Ai, A2) in not an e' -equilibrium. 



9 Generalizations 

In this section we generalize the results of the previous sections in several 
ways. First, we see that the main result (Theorem 13. 3p holds, with some 
adjustments, even when each player does not know his own discount factor. 
Second, we generalize Theorem 13.31 for general games with incomplete infor- 
mation, with respect to a course of action that is an equilibrium in every state 
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of the world (equivalent to a*), and a course of action that is an equilibrium 
only in some states of the world (equivalent to r*). Last, we analyze the case 
of repeated games with incomplete information on the discount factor with 
more than two players. 

9.1 The Results when One's Own Discount Factor is 
Unknown 

In this section we assume that each player does not know his own discount 
factor in every state of the world. In this case, most of the arguments in the 
proof of Theorem 13.31 still hold, with the following adjustments. In order to 
avoid measurability problems, we assume in this section that Q is finite or 
countable, S is the power set of Q, and Sj is generated by the "types" of 
player i. 

When calculating the expected payoffs, we cannot take Aj out of the 
expectation, since Xi{u)) is no longer known to player i given u. For example, 
instead of: 

= P^iK, I uj) (^) + (1 - ^K, I u)) {u,{n,a,)+u,{a)^) 
when u E Ki, the expected payoff is: 

+ (1 - P,{Kj I u)) (^Uiin, aj) + Ui{a)E, | w)) . 

Similarly, Aj is no longer the set {cu G f2 | Xi{u)) > A°}, which is no longer 
necessarily an z-measurable set. 

Rather := G | Va,' ^i(f^ - {ui{a[, tj) + u^{a)j^^) | w) > o|. That 
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is the set of states of the world in which player i believes he cannot profit by 

deviating from the profile r*. 

The main adjustment in Theorem 13.31 itself is that it is not necessary that 
C Aj. Indeed, this requirement was derived from the inequality: 

and because Xi > 0, it follows that either Pi{Kj | u;) = or w G A^. 

When one's own discount factor is unknown, this inequality becomes 

\0 ' - °' 

but here Ei ^Aj ^j^^ " i'^ii'^o '^i) + T^)) I > is weaker than a; G 
Aj. Therefore, there may be games such that there are events Ki ^ Aj 
such that T]*{Ki, K2) is a Bayesian equilibrium, but then the conditions are 
different from these of Theorem 13.31 — if w G -ft'j \ Aj, player i may have to 
/i-believe in Kj in the state of the world u, for a certain function h, since 
he has a profitable deviation from r* in u. This may be in in addition to 
/-believing in Kj in the state of the world u. See Example 19.31 

Under the assumption that Ki C Ai for i = 1,2, the rest of the proof 
does not change, and so we get the following result. 

Theorem 9.1 In the game G, the strategy profile r]*{Ki, K2) , such that 
Ki C Ai and K2 C A2, is a Bayesian equilibrium, if and only if, for i = 1,2, 

1. Pi{Kj I u) > fi{uj) for every u e Ki, 

2. Pi{Kj I u) < Qiiuj) for every uj ^ Ki. 

The functions fi and gi are the same as before, with the natural adjust- 
ments; for example, instead of: 

f iuj) ■= max ' p u,{a'^,aj)^u,{n,a,) 
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now we have: 

f (, ,\ ™„„ , u,{cr[,(Tj)-u,{T„crj) 

' " i?,(^-K(a^,r,)+«,(<7)^)|<^)+K(<7^,a,)-«.(r.,<7,)) 

Note also that the sets over which the maxima and minima are taken when 
calculating / and g may change according to the same adjustments. 

Remark 9.2 • Note that because now we assume that Ki C Aj for i = 
1,2, we can omit the assumption that Ti is not a best response to cTj, 
retaining only the weaker assumption that Ti is not in the support of (Xi. 

• Because of that, as mentioned in Remark \3.5\ (2). we may get that fi = 
0. This may also be the case even ifri is not a best response to crj (which 
was not the case in the original theorem, see Remark \ 3.0{ 2)). That 
happens if and only if the distribution \i{uj) such that Ei ^ytx" I ~ 
oo. 

• In this case, the functions fi and gi have a slightly different form if we 
take the payoffs as 7^ = — Aj) Yltli I ^) ('^'^^h the (1 — Aj) 
in the expectation). The only significant difference is that in this case 
the payoffs are bounded and therefore Remark 137^ 2) still holds. 

The following example shows that the assumption Ki C Aj for i = 1, 2 in 
Theorem 19 . 1 1 cannot be omitted — t]*{Ki, K2) may be a Bayesian equilibrium 
without this assumption, but then the conditions are quite different. 

Example 9.3 Consider the following game: 
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10,- 
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(payoffs not indicated can be arbitrary). Here a = {D,D), t = {C,C). 
Both players have the same discount factor, that can be either 4/5 or 2/5, 
so that S = {2/5,4/5}. Let the information structure be the following: Q = 
S X Si X S2, where Si = {Hi, Li, Xi} are the possible signals of player i. 
Denote the signal that player i receives by Sj. The first coordinate is the 
common discount factor in each state of the world, that is X{{a, Si, S2)) = a. 
Each player knows only his own signal, and his beliefs are as follows: If player 
i gets the signal Hi he believes that the state of the world is {4/5, Hi, H2): 
the discount factor is 4/5 and this fact is common belief. If player i gets the 
signal Li he believes that the state of the world is {2/5,Li,L2): the discount 
factor is 2/5 and this fact is common belief. If player i gets the signal Xi, 
he believes that the state of the world is {2/5, Xi, Lj) with probability p, and 
{A/5, Xi, Hj) with probability 1 —p, where 3/20 < p < 6/23. 

The best deviation from r* is to play N in the first stage and from 
the second stage on to play D. This deviation is profitable for player i if 
and only if Ei [ui{C,C)-^ - {ui{Ni,Cj) + Ui{D,D)-^) | w) > 0, that is if 
Ei I i^) > 7. Since p > 3/20, it follows that Aj = {si = Hi}. 

Denote Ki = {si = Hi or Si = Xi}. We prove that ri*{Ki, K2) is a 
Bayesian equilibrium, even though Ki ^ Aj. Because the game, the infor- 
mation structure and ri*{Ki, K2) are symmetric, it is sufficient to show that 
there is no profitable deviation for player 1. If player I's signal is Si = Hi, 
he believes that it is a common belief that A = 4/5, and that player 2 plays 
. Since in this case u G Aj, there is no profitable deviation from r* for 
player 1. If player 1 's signal is si = Li, he believes that player 2 always plays 
D, and therefore player 1 cannot profit from deviating from r]l{Ki) (which 
in this case is "always play D"). If player I's signal is Si = Xi, he plays 
Ti . Similar to the proof of Theorem \3.3l to prove that there is no profitable 
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deviation, two inequalities have to be satisfied: 

P^{K2 I oo)E, (^X - Ma[,C) + I io^ > 0, 

and 

P,{K, I u;)E, - C) + ^) | uj) + 

+ (1 - Pi(ir2 I a;))(0 - D)) > 0, 

/or a[ = D or (t'i — A'^. For a'^ — D, the first inequality holds since A > 1/3 
with probability 1 (it is the same inequality as in the prisoner's dilemma, 
where \\ = 1/3). The second inequality is equivalent to Pi{K2 \ cj)(8 — 
YP) — 1 > 0. For a'l = N, the first inequality holds since p < 6/23. The 
second inequality is equivalent to —Pi{K2 \ Ijj){9 + ^P) + 10 > 0. Therefore, 
we have that ri*{Ki, K2) is a Bayesian equilibrium if and only if Pi{K2 \ 
w)(8 - f - 1 > and 10 - Pi{K2 | w)(9 + f > 0. The first inequality 
holds for high enough Pi{K2 \ uj), which is similar to the first condition in 
Theorem \9.1\ but the second inequality only holds for low enough Pi{K2 \ oj), 
which is a significantly different condition (it is h-believing in K2 in addition 
to f -believing in K2). 

To complete the proof that ri*{Ki, K2) , observe that when Si = Xi, we 
have Pi{K2 \ u) = 1—p. Therefore 10 — Pi{K2 \ Ijj){9 + yP) > zs equivalent 
to (1 - p)(l - ^) + lOp > 0. Since p < 6/23, we have 1 - ^ > -g, so 
(1 - p){l - ^) + lOp > ^p - ^, and ^p - ^ > for every p > 3/20. 
Similarly, Pi{K2 \ a;)(8 — yp) — 1 > zs equivalent to (1 — p)(8 — yp) — 1 > 0, 
which holds for every p < 7/10. 

Our results regarding games with only two actions for each player does 
not hold in this more general case. Lemma 15.11 does not hold in this case: 
g] = 1> afi.^) = gK^) = l ^^i every u ^ A^, and gf{u) = fi{u) < gf{u) for 
every G Aj still hold, but gf{u}) needs not be 1 for every u ^ Aj. Therefore, 
Corollary 15.21 does not hold in this case. 
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9.2 General games 

In this subsection we expand Theorem 13.31 to a broader class of two-players 
Bayesian games (not only repeated games with incomplete information re- 
garding the discount factors). While up till now we assumed that the sets of 
actions and the payoff functions are independent in the state 

of the world, now it is not the case: for every u E Q, Ai{s{ijj)) is the set of 
the actions of player i in the state of the world u, and Ui{s{ijj)) is his payoff. 

Let G = {N, (S, iS),n, (Aj)jgAr, ('Ui)igAr) be a general two-player Bayesian 
game. To avoid measurability problems, assume that the states of the world 
are finite or countable. Assume that there is a course of action profile cr* = 
(o"j[',cr|) such that, when the information is complete, is an equilibrium for 
all states of nature, and that there is another course of action profile, r* = 
(r^*, T2) which is an equihbrium in only some states of nature. Suppose that 
the supports of r* and a* are disjoint in all states of the world, that is, it 
is discernable whether player i plays a* or r*. A strategy of player i is an 
z-measurable function that assigns each state of the world a course of action 
of player i. 

Let Aj C be the event "player i believes that he cannot benefit by 
deviating from the profile r*". That is, for every w G Aj and every course of 
action a,' of player i, Eiiuiij*) \ u) > Ei{ui{a[,T*) \ uj). 

Theorem 9.4 In the game G, there exist i -measurable functions < fiiQi < 
1,2 = 1, 2, such that if Ki C Ai and K2 C A2, the strategy profile r]*{Ki, K2) = 
{rjl{Ki) , ri2{K2)) , is a Bayesian equilibrium, if and only if, for i = 1,2, 

1. Pi{Kj I u) > fi{u) for every u e Ki, 

2. Pi{Kj I u) < Qiiuj) for every u ^ Ki. 
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Remark 9.5 There are several differences between the results of this theorem 
and Theorem VJ.'^ 

• Here we assume that Ki C Ai and K2 C A2 whereas in Theorem \3.3\ we 
showed it was necessary for ri*{Ki,K2) to be a Bayesian equilibrium. 
This weakened result allows us to drop the assumption that r* is not 
a best response to a*, and only requires that it is discernable whether 
player i plays a* or t* (disjoint supports)^ 

• We do not assume that the realized payoffs are observed, so that all a 
player knows is his an expected payoff based on his information on the 
states of nature. 

• We do not assume that the payoffs when t* is played are higher than 
when a* is played. 

Proof of Theorem 19. 4t Similar to the proof of Theorem 13. 3[ we have two 
cases. 

Case 1: CO E K^. 

The payoff of player i when ri*{Ki, K2) is played is 

If player i deviates to a course of action 7^ r*, then his payoff is 

= P,{Kj I uj)E,{ui{a[,T*) + P,{Kj \ uj))Ei{ui{a[,a*) \ uj). 

^Still, as before, if we do assume that t* is not a best response to cr*, we get fi>0, 
assuming that the payoffs in G are bounded. 
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Because rj*{Ki,K2) is a Bayesian equilibrium, ■ji{ri*) > 7i(cr-,r7*) for every 



Pi{Kj I u;){E,{u,{T*)-u,{a[,T;) \ cu) - E,{u,{t* , a*) - u,{ai, a*) \ cu)) + 

+Ei{u.,{T:,a*)-u.,{a[,a*) \co)>0, (6) 

for every cr^' 7^ r*. By assumption Ki C Ai, so that Eiiuiij*) — Ui{a[,T*) \ 
uj) > 0. If Ei{ui{T* , a*) -Ui{al,a*) \ u) > 0, then inequahty trivially 
holds. Otherwise, inequality (jS]) is equivalent to Pi{Kj \ u) > fi{uj), where 

, Ei{u^{a'i,a]) -Ui{T*,a*) \ u) 

and = {a,; | a*) | tj) < Ei{ui{a'^,a*) \ u)}. 

Case 2: UJ ^ Ki. 

The payoff of player i under ri*{Ki, K2) is 

= Pi{Kj I uj)Ei{ui{ai,Tj) I w) + (1 - Pi(irj- I w))Fi(Mi(a) | 00). 
If player i deviates to a course of action a'^ 7^ a*, then his payoff is 

-ii{a[,7]*{Kj) I uj) = 

= P,iK^ I u)E,{u,{a',,T;) I a;) + (1 - P^iK, \ uj))E,{u,{ala*) \ uj). 

Since t]*{Ki, K2) is a Bayesian equilibrium, 

%{r]*{Ki, K2) I w) > %{a'i,ri*{Kj) \ u) for every cr- 7^ cr*, or equivalently 

Pi{Kj I a;)(Ei(Mj((T*,r*) - Ui{cr[,T*) \ u) - Ei{ui{a*) - Ui{o[,o*) \ u)) + 

+Ei{ui{a*)-Ui{a[,a*) | > 0, (7) 

for every a[ 7^ cr*. Because a* is an equilibrium in every state of nature, 
Ei{ui{a)—Ui{a[,aj) > 0. Therefore, if Ei{ui{(Ti, tj) —Ui^a^, tj) > 0, inequality 
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([7j) trivially holds. Otherwise, inequality ([7]) is equivalent to Pi{Kj \ u) < 
Qiioj) where 

.^^ Ei{u,{a*)-Ui{a[,a*) \ uj) 

^'^"^^ ■ Ei{ui{a*) - Ui{al a*) \ u) + Ei{ui{a[, t*) - mia*, t*) \ 00) ' 

and Gi = {al \ Ei{ui{a*,T*) \ cu) < E,{u,{a[,T*)) \ u)]. □ 
9.3 More than Two Players 

In this section we show that while the same analysis can be used to derive 
necessary and sufficient conditions for conditional-grim-trigger equilibria in 
repeated games with incomplete information on the discount factors with 
more than two players, these conditions are much more complex than in two 
player games. We provide simple conditions, similar to those in previous sec- 
tions (i.e., /-believing in an event at a state of the world), which are sufficient 
for a conditional-grim-trigger strategy profile to be a Bayesian equilibrium, 
but are not necessary (Theorem 19. 6p . 

Let r = (A^, (y4j)jg7V5 (^j)ieAf) be a one-shot game, with = {1, 2, A^} 
the set of players. Let a = (ai, o"2, (Tat) be a mixed-strategies Nash equi- 
librium in r. Assume that the payments in another non-equilibrium pure 
action proffie r = {ti,T2, ...,tn), are higher than the equilibrium payments, 
that is Ui{T) > Ui{a) for every i & N. Also, assume that for every i & N, Ti 
is not in the support of dj. 

Let G = {N, (5, 5),n, (y4j)jg7v, (^j)jGAf) be the repeated game based on 
r, with incomplete information regarding the discount factors, similar to the 
one described in Section |2l where each player knows his own discount factor. 

Let r* and a* be defined as in the two-player casejfl Again, a* is an 

®When there are more than two players, r* is triggered whenever there is at least one 
player that deviate from r* . 
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equilibrium course of action regardless of the discount factors. Player i does 
not have a profitable deviation from r* if and only if Aj > A°, where 

A° := min | A, | ^ - {u.ial, r_,) + u^icr)^) > Va^ ^ r,| . 

As before, denote Ai :— {u e \ Xi{uj) > A°}. 

The conditional-grim-trigger strategy is defined in the same way as in the 
two-player case, only now there is a cooperation event for eack player in A^, 
so the strategy profile is 7]*{Ki, K2, Kn) = {7]*i{Ki),7]^{K2), ...,7]%{Kn)). 
The following theorem gives sufficient conditions that guarantee that the 
profile r)*{Ki, K2, Kn) is a Bayesian equilibrium. 

Theorem 9.6 Suppose that Ki C Aj, for every i & N. The strategy profile 
ri*{Ki, K2, Kj^) is a Bayesian equilibrium in the game G, if, for every 
i e A^, 

PiiOj^i^j I > /i(^) for every u e Ki, 
^- PiiUj^i I ^) < 9i{^) for every u ^ Ki. 
Where the functions fi and Qi are defined by: 

fi{LJ) := max 



and gi{uj) := mm{gl,g^{uj)}, where 



1 Ui(a) - Ui(a'i, a-i) 

g- := max 



T'ii^ruai Ui{a) - Uila'i, a-i) + 2Mj ' 

(1 - Xi{uj)){ui{a) - Ui{Ti, a_i)) 



(1 - Xi{u)){u,{a) - u,{t„ cr_,)) + 2Mi' 
and Mi := max{|Mj(ai, ...,aAr)| | % e Aj for j G A^}. 
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These conditions mean that in order for ri*{Ki, K2, K]\f) to be a Bayesian 
equihbrium, player i needs to /-believe in a the event fl^yj whenever he 
plays T*, and to (1 — g)-he[ieve in the event Uj^j whenever he plays a*. 
Note that when there are more then two players, these two events may not 
be the same. 

Proof: The proof follows the same lines as the proof of Theorem 13. 3[ For 
simplicity, we assume that = 3. The analysis for games with more than 
three players is similar. Without loss of generality, it is sufficient to prove 
that player 1 does not have a profitable deviation. 
Case 1: cu E Ki. 

Player I's payoff under the strategy profile ri*{Ki, K2,K^) is 

^,{r^*{K,,K2,K^)\uj) = 

P,{K,r^K,\uJ) (t^) + 

P,{K2 \ K, I u) (wi(ri, r2, ^3) + u,{a)^^) + 

(1 - P.iK^UK, I uj)) (u^in,a2,as)+u^ia)j^) . 

As in the proof of Theorem 13. 3[ we need to consider the following deviations: 

• Let a[ 7^ Ti, and define the course of action a'^* by: Player 1 plays Ti 
in the first stage, an if the action profile r was played in the first stage, 
player 1 plays a pure action a[ in stage 2 and ai afterwards. If the 
action profile r was not played in the first stage, player 1 plays cti from 
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the second stage onwards. If player 1 plays cr^**, his payoff is: 

7i(c7r,%*(i^2),^,;(i^3) |a;) = 

Pi{K,nKs\uj) (^xi,(r) + «iK,r2,T3)AiH+«i(a)g^) + 

P,(K, \ Ks I u) (u,(n, r2, as) + u,{a)j^) + 

(1 - P,{K, U I c^)) («i(ri, a2, as) + . 

Since r]*{Ki, K2, K^) is a Bayesian equilibrium, ^i{r]*{Ki, K2, K^) \ 
> -fi{a[**,r]^{K2),r)^{K2) | a;), or equivalently, 

P.iK^nKs I a;)AiH - KK, r2, ra) + ki(<7)^)) > 0, 

for every a[ 7^ Ti. Because Ai > 0, either Pi{K2 fl | cu) = or 
a; e Ai, which is a similar result to the two-player case. 

• Let a[ 7^ Ti, and define the course of action a'^ by: Player 1 plays a[ 
in the first stage and cxi afterwards. If player 1 plays a'l, his payoff is: 

7iK,772*(i^2),%*(i^3) = 

Pi{K2 n K3 I uj)ui{a[,r2, T3) + Pi(X2 \ i^s I uj)ui{a[,r2, (73)+ 
Pi(ir3 \ I w)miK, a2, r3 + (1 - Pi(ir2 U K3 \ Lj))ui{a[, cxa, £13) + 

Since r]*{Ki, K2, K^) is a Bayesian equilibrium, 7i(?7*(iri, 7^2, -f^s) | 
t^) > -fi{a[*,r)^{K2),r);{K3) \ uj), or equivalently, 

Pl(X2 \ K3 I a;)(rti(Ti,T2,C73) - lti((Ti, T2, (73)) + 

PiiK^ \ K2 I c<;)(Mi(ri,a2,r3) - aa, 0-3)) + 

(l-Pl(i^2Ui^3 I a;))(Mi(Ti, (72,(73) -MlK, (72,(73)) >0. (8) 
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This is the analogue of inequahty ([T]) in the two-players game, only here 
this condition is not equivalent to a simple /-belief type condition. 
However, the left hand side of inequality ([8]) is no less than 

2M^{l-P,{K2nK,\u)), 
and so it is a sufficient condition that 

2Mi{l-Pi{K2nKs I u)) > 0, 

for every a[ ti, which is equivalent to Ki C Ai and Pi{K2 fl K3 \ 
> fiiu). 

Case 2: u ^ Ki. 

Player I's payoff under the strategy profile rj*{Ki, K2, K3) is 

P,{K2 nKs\cu) r2, rg) + ni(a)^M^) + 

P,{K2\K, I u) (u,ia,,T2,as)+u^ia)j^) + 
P,{K, \ K2 I UJ) (72, rs) + u,{a)^^) + 

As in the proof of Theorem I3.3[ we need to consider the following deviations: 

• Deviation to a^*, for a[ 7^ ri, ai {a[ may be in the support of ui, if ai 
is not a pure action). The payoff is: 

^,{a[\r,l{K2),r,l{K,)\u) = 

Pi{K2nK3 I Uj)ui{a[,T2,T3) + Pi{K2\Ks I uj)ui{a[,T2,a3) + 
Pi{Ks \ K2 I as, Ta + (1 - Pi(ir2 U | uj))ui{a[, ^2, ^3) + 
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Since ri*{Ki, K2, K^) is a Bayesian equilibrium, 'yi{ri*{Ki, K2, K^) \ 
uj) > -fiia'*,r]*{K2),r]*{K3) \ u), or equivalently, 

Pl{K2 n K3 I Uj){ui{(Ti,T2,T3) - Ui{a[, T2, T3)) + 

Pi{K2 \ Ks I u){ui{ai,T2,a3) - Ui{a[,T2, cr-i)) + 

Pl{K3 \ K2 I Uj){Ui{ai, (T2, Tg) - Ui{a[, (T2, (Tg)) + 

(1 - Pi{K2 U K3 I a;))(ni(a2) - ui{a[, 0-2, ^3)) > 0. (9) 

This is the analogue of inequality (|2]) in the two-players game, only here 
this condition is not equivalent to a simple /-belief type condition. 
However, the left hand side of inequality ^ is no less than 

{l-Pi{K2UK3\ uj)){ui{<y) - m{a[, ^2, as)) - 2MiPi(K2 U i^s I cu), 

and so it is a sufficient condition that 

(1 - Pi{K2 U K3 I oj)){ui{a) - MiK, (T2, aa))- 
2MiPi{K2UK3 I w) > 0, 

for every a[ 7^ ri, ai, which is equivalent to Pi{K2 U K3 \ u) < g}. 

Deviation to r^. The payoff is: 

j,{T*,r]*{K2),v*3iK3)\u;) = 
P,iK2nK3\u^) (t^) + 
P,iK2 \ Ks I uj) (ni(ri, r2, ^3) + u,ia)j^) + 
PiiKs \K2\u) (^Mi(ri,a2,r3) +ui{a)j 



{l-Pi{K2UK3 I w)) (^Mi(ri,(T2,(T3) + ui(a)Y 



+ 



Since ri*{Ki, K2, K^) is a Bayesian equilibrium, 7i(?7*(fri, _ft'2? -^3) 
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> -fi{T*,r];{K2),r]*{K3) \ u), or equivalently, 

Pi(fr2 \ K3 I a;)(Mi(cri,r2,o-3) - r2, 0-3)) + 

Pi{K3 \ K2 I a;)(ni(ai,cr2,r3) - Ui(ri, 0-2, 0-3)) + 
(l-Pi(ir2Uir3 |c^))KM-Wi(ri,a2,a3)) >0.(10) 

This is the analogue of inequahty ([3]) in the two-players game, only here 
this condition is not equivalent to a simple /-belief type condition. 
However, the left hand side of inequality (fTOl) . multiplied by 1 — Ai(ci;), 
is no less than 

(1 - Pi(i^2 U Ks I C0))Ma) - Uiin, (T2, CTs)){l - AiH)- 

2M,Pi{K2UK3 I u), 

and so it is a sufficient condition that 

(1 - PiiK2 U Ks I co)){m{a) - uiin, (T2, a3))(l - Ai(a;))- 
2MiPi{K2VJ K3 I w) > 0, 

which is equivalent to Pi{K2 U \ oj) < gf{uj). 
• Deviation to erf, defined by: 

— Play Ti in the first stage. 

— If the profile r was played in the first stage, play a pure a[ 7^ n, 
and afterwards (Xi. 

— If the profile r was not played in the first stage, play cti from the 
second stage onwards. 
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The payoff is: 

7iK\r^2*(^2),%*(i^3) |u;) = 

(1 - P,iK, U Ks I u)) (u,in,a,, as) + u,ia)j^) . 

Since ri*{Ki, K2, K^) is a Bayesian equilibrium, 'yi{ri*{Ki, K2, K3) \ 
(^) > 7i(o-f ,^2(^2),'7|(^3) I w), or equivalently, 

Pi{K2 nK^lu) {ui{ai, r_i) - ui{t) + {ui{a) - Ui{a[, r_i))Ai(u;)) + 
PiiK2 \ K3 I uj){ui{cri,T2,a3) - Ui(ri, Tg, (Ts)) + 
PiiK^ \ K2 I u;)(Mi(ai,a2,r3) - Mi(ri, (T2, 0-3)) + 
{l-Pi{K2UK^\uj)){m{a)-m{ri,a2,a^))>{].{ll) 

This is the analogue of inequality (jl]) in the two-players game, only here 
this condition is not equivalent to a simple /-belief type condition. 
However, the left hand side of inequality f|TT]) is no less than 

(l-Pi(ir2UA^3 I uj)){ui{a) - Ui{T^,a2,a^))- 
2MiPi{K2 U I w)(l + AiH), 

and so it is a sufficient condition that 

{l-Pi{K2UK3 I a;))(ni((T) -nl(rl,(T2,(T3))- 
2MlPl(i^2 U I + Ai(a;)) > 0, 

which is a weaker condition than Pi{K2 U \ u) < gf{uj)- 

The proof that fi and Qi are z-measurable is the same as in the proof of 
Theorem 13 .Si □ 
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